2025-12-04T09:21:40.7900703Z Current runner version: '2.330.0' 2025-12-04T09:21:40.7904986Z Runner name: 'ctr-navi3x-b13-ws9' 2025-12-04T09:21:40.7905591Z Runner group name: 'linux.rocm.gpu.group' 2025-12-04T09:21:40.7906265Z Machine name: 'ctr-navi3x-b13-ws9' 2025-12-04T09:21:40.7908253Z ##[group]GITHUB_TOKEN Permissions 2025-12-04T09:21:40.7909735Z Contents: read 2025-12-04T09:21:40.7910132Z Metadata: read 2025-12-04T09:21:40.7910549Z ##[endgroup] 2025-12-04T09:21:40.7912115Z Secret source: Actions 2025-12-04T09:21:40.7912592Z Prepare workflow directory 2025-12-04T09:21:41.0832640Z Prepare all required actions 2025-12-04T09:21:41.0865632Z Getting action download info 2025-12-04T09:21:41.4344400Z Download action repository 'pytorch/pytorch@main' (SHA:7716da9fb23f27a65b41f9f016a2afadf281c18f) 2025-12-04T09:21:48.6070717Z Download action repository 'pytorch/test-infra@main' (SHA:39aa74d619174326f4e2fb0e216151c2f29d9ffd) 2025-12-04T09:21:49.5303328Z Download action repository 'actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T09:21:50.1831118Z Download action repository 'aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722' (SHA:ececac1a45f3b08a01d2dd070d28d111c5fe6722) 2025-12-04T09:21:50.8939379Z Getting action download info 2025-12-04T09:21:51.0430982Z Download action repository 'actions/checkout@v4' (SHA:34e114876b0b11c390a56381ad16ebd13914f8d5) 2025-12-04T09:21:52.1277307Z Getting action download info 2025-12-04T09:21:52.2935596Z Download action repository 'nick-fields/retry@v3.0.0' (SHA:7152eba30c6575329ac0576536151aca5a72780e) 2025-12-04T09:21:52.7907379Z Getting action download info 2025-12-04T09:21:52.9977183Z Uses: pytorch/pytorch/.github/workflows/_rocm-test.yml@refs/heads/main (ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32) 2025-12-04T09:21:52.9980044Z ##[group] Inputs 2025-12-04T09:21:52.9980253Z build-environment: linux-jammy-rocm-py3.10 2025-12-04T09:21:52.9980744Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}]} 2025-12-04T09:21:52.9981459Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:21:52.9981888Z sync-tag: 2025-12-04T09:21:52.9982386Z timeout-minutes: 300 2025-12-04T09:21:52.9983277Z tests-to-include: test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune 2025-12-04T09:21:52.9984155Z dashboard-tag: 2025-12-04T09:21:52.9984515Z disable-monitor: true 2025-12-04T09:21:52.9984682Z monitor-log-interval: 5 2025-12-04T09:21:52.9984857Z monitor-data-collect-interval: 1 2025-12-04T09:21:52.9985040Z ##[endgroup] 2025-12-04T09:21:52.9985278Z Complete job name: linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:21:53.0738521Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2025-12-04T09:21:53.0738985Z with: 2025-12-04T09:21:53.0739112Z no-sudo: true 2025-12-04T09:21:53.0739253Z submodules: recursive 2025-12-04T09:21:53.0739398Z fetch-depth: 0 2025-12-04T09:21:53.0739630Z env: 2025-12-04T09:21:53.0739751Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:21:53.0739902Z ##[endgroup] 2025-12-04T09:21:53.0796971Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:21:53.0797551Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:21:53.0811794Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:21:53.0812229Z env: 2025-12-04T09:21:53.0812351Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:21:53.0812504Z ##[endgroup] 2025-12-04T09:21:53.0968417Z ##[group]Run # Use all available CPUs for fetching 2025-12-04T09:21:53.0968701Z # Use all available CPUs for fetching 2025-12-04T09:21:53.0968903Z cd "${GITHUB_WORKSPACE}" 2025-12-04T09:21:53.0969098Z git config --global fetch.parallel 0 2025-12-04T09:21:53.0969317Z git config --global submodule.fetchJobs 0 2025-12-04T09:21:53.0969511Z  2025-12-04T09:21:53.0969721Z # Clean workspace. The default checkout action should also do this, but 2025-12-04T09:21:53.0969987Z # do it here as well just in case 2025-12-04T09:21:53.0970172Z if [[ -d .git ]]; then 2025-12-04T09:21:53.0970339Z  if [ -z "${NO_SUDO}" ]; then 2025-12-04T09:21:53.0970511Z  sudo git clean -ffdx 2025-12-04T09:21:53.0970677Z  else 2025-12-04T09:21:53.0970807Z  git clean -ffdx 2025-12-04T09:21:53.0970980Z  fi 2025-12-04T09:21:53.0971108Z fi 2025-12-04T09:21:53.0982344Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:21:53.0982562Z env: 2025-12-04T09:21:53.0982687Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:21:53.0982839Z NO_SUDO: true 2025-12-04T09:21:53.0983030Z ##[endgroup] 2025-12-04T09:21:53.4565564Z Removing .additional_ci_files/ 2025-12-04T09:21:53.4566144Z Removing build/ 2025-12-04T09:21:53.4566610Z Removing dist/ 2025-12-04T09:21:53.4566956Z Removing test/test-reports/ 2025-12-04T09:21:53.4676198Z ##[group]Run actions/checkout@v4 2025-12-04T09:21:53.4676644Z with: 2025-12-04T09:21:53.4677013Z ref: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:21:53.4677486Z fetch-depth: 0 2025-12-04T09:21:53.4677829Z submodules: recursive 2025-12-04T09:21:53.4678179Z show-progress: false 2025-12-04T09:21:53.4678576Z repository: pytorch/pytorch 2025-12-04T09:21:53.4679201Z token: *** 2025-12-04T09:21:53.4679516Z ssh-strict: true 2025-12-04T09:21:53.4679863Z ssh-user: git 2025-12-04T09:21:53.4680208Z persist-credentials: true 2025-12-04T09:21:53.4680575Z clean: true 2025-12-04T09:21:53.4680924Z sparse-checkout-cone-mode: true 2025-12-04T09:21:53.4681332Z fetch-tags: false 2025-12-04T09:21:53.4681650Z lfs: false 2025-12-04T09:21:53.4681967Z set-safe-directory: true 2025-12-04T09:21:53.4682315Z env: 2025-12-04T09:21:53.4682615Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:21:53.4682962Z ##[endgroup] 2025-12-04T09:21:53.5526323Z Syncing repository: pytorch/pytorch 2025-12-04T09:21:53.5528141Z ##[group]Getting Git version info 2025-12-04T09:21:53.5528762Z Working directory is '/home/pytorchci/actions-runner/_work/pytorch/pytorch' 2025-12-04T09:21:53.5529584Z [command]/usr/bin/git version 2025-12-04T09:21:53.5529919Z git version 2.34.1 2025-12-04T09:21:53.5531017Z ##[endgroup] 2025-12-04T09:21:53.5535908Z Copying '/home/pytorchci/.gitconfig' to '/home/pytorchci/actions-runner/_work/_temp/e288b110-cd1a-444a-9733-2a163bda2758/.gitconfig' 2025-12-04T09:21:53.5538110Z Temporarily overriding HOME='/home/pytorchci/actions-runner/_work/_temp/e288b110-cd1a-444a-9733-2a163bda2758' before making global git config changes 2025-12-04T09:21:53.5539460Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T09:21:53.5540569Z [command]/usr/bin/git config --global --add safe.directory /home/pytorchci/actions-runner/_work/pytorch/pytorch 2025-12-04T09:21:53.5549524Z [command]/usr/bin/git config --local --get remote.origin.url 2025-12-04T09:21:53.5576862Z https://github.com/pytorch/pytorch 2025-12-04T09:21:53.5592102Z ##[group]Removing previously created refs, to avoid conflicts 2025-12-04T09:21:53.5595609Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-12-04T09:21:53.5620336Z HEAD 2025-12-04T09:21:53.5651448Z ##[endgroup] 2025-12-04T09:21:53.5653444Z [command]/usr/bin/git submodule status 2025-12-04T09:21:53.5953826Z 7e1e1fe3858c63c251c637ae41a20de425dde96f android/libs/fbjni (v0.1.0-12-g7e1e1fe) 2025-12-04T09:21:53.6053502Z 4dfe081cf6bcd15db339cf2680b9281b8451eeb3 third_party/FP16 (4dfe081) 2025-12-04T09:21:53.6144420Z b408327ac2a15ec3e43352421954f5b1967701d1 third_party/FXdiv (b408327) 2025-12-04T09:21:53.6244454Z c07e3a0400713d546e0dea2d5466dd22ea389c73 third_party/NNPACK (c07e3a0) 2025-12-04T09:21:53.6293380Z 3ebbc93ded7285963bff932c678fa367eb393ba6 third_party/NVTX (v3.1.0-313-g3ebbc93) 2025-12-04T09:21:53.6369466Z 1d8f600fd424278486eade7ed3e877c99f0846b1 third_party/VulkanMemoryAllocator (v2.1.0-982-g1d8f600) 2025-12-04T09:21:53.6781668Z 51a0103656eff6fc9bfd39a4597923c4b542c883 third_party/XNNPACK (remotes/origin/ds/ndk-1243-g51a0103656) 2025-12-04T09:21:53.6819194Z 01aae101b9e5e94d6c16a9514c9fb8df99c93150 third_party/aiter (v0.1.1-92-g01aae101) 2025-12-04T09:21:53.6844466Z 299e5928955cc62af9968370293b916f5130916f third_party/benchmark (v1.9.3) 2025-12-04T09:21:53.6914873Z 7fe50dc3da2069d6645d9deb8c017a876472a977 third_party/composable_kernel (rocm-6.4.3-459-g7fe50dc3d) 2025-12-04T09:21:53.7032451Z 89c932f313c6437c38f2982869beacc89c2f2246 third_party/cpp-httplib (v0.26.0) 2025-12-04T09:21:53.7156430Z f858c30bcb16f8effd5ff46996f0514539e17abc third_party/cpuinfo (f858c30) 2025-12-04T09:21:53.7195336Z 0b1577c8c83401237d601d0d0db5210506705396 third_party/cudnn_frontend (v0.5-61-g0b1577c) 2025-12-04T09:21:53.7314062Z e67e63c331d6e4b729047c95cf6b92c8454cba89 third_party/cutlass (v4.3.0) 2025-12-04T09:21:53.7350117Z c0b988d39a9e47c794d699f29930ed4d7c7e13a4 third_party/fbgemm (v1.4.0-rc1-2-gc0b988d39) 2025-12-04T09:21:53.7442509Z 979702c87a8713a8e0a5e9fee122b90d2ef13be5 third_party/flash-attention (v2.7.4) 2025-12-04T09:21:53.7468627Z a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757 third_party/flatbuffers (v24.12.23) 2025-12-04T09:21:53.7783652Z 407c905e45ad75fc29bf0f9bb7c5c2fd3475976f third_party/fmt (12.1.0) 2025-12-04T09:21:53.7887643Z 3fb5c176c17c765a3492cd2f0321b0dab712f350 third_party/gemmlowp/gemmlowp (remotes/origin/revert-87-master-135-g3fb5c17) 2025-12-04T09:21:53.8018049Z 54cbae0d3a67fa890b4c3d9ee162b7860315e341 third_party/gloo (remotes/origin/gh/c-p-i-o/1/base-37-g54cbae0) 2025-12-04T09:21:53.8192260Z 52eb8108c5bdec04579160ae17225d66034bd723 third_party/googletest (release-1.8.0-3544-g52eb8108) 2025-12-04T09:21:53.8267792Z 719d8e6cd7f7a0e01b155657526d693acf97c2b3 third_party/ideep (pytorch-rls-v3.7.1) 2025-12-04T09:21:53.8341740Z dec1d23ca65ab069d225dfe40dea14f455170959 third_party/ittapi (v3.25.5) 2025-12-04T09:21:53.8529061Z 31f85df8fbd89c188f14ef10f1ec65379786b943 third_party/kineto (heads/main) 2025-12-04T09:21:53.8560944Z d7770c89632329a9914ef1a90289917597639cbe third_party/kleidiai (v1.15.0) 2025-12-04T09:21:53.8590506Z fbd8b99c2b828428947d70fdc046bb55609be93e third_party/mimalloc (v2.2.4) 2025-12-04T09:21:53.8633551Z 55f93686c01528224f448c19128836e7df245f72 third_party/nlohmann (v3.12.0) 2025-12-04T09:21:53.8889841Z e709452ef2bbc1d113faf678c24e6d3467696e83 third_party/onnx (v1.18.0) 2025-12-04T09:21:53.8917118Z a799f4aed9c94b765dcdaabaeab7d5e7e2310878 third_party/opentelemetry-cpp (v1.14.2) 2025-12-04T09:21:53.8954183Z 0fa0ef591e38c2758e3184c6c23e497b9f732ffa third_party/pocketfft (release_for_eigen-40-g0fa0ef5) 2025-12-04T09:21:53.9245426Z d1eca4e4b421cd2997495c4b4e65cea6be4e9b8a third_party/protobuf (v3.7.0-rc.2-1279-gd1eca4e4b) 2025-12-04T09:21:53.9347615Z 072586a71b55b7f8c584153d223e95687148a900 third_party/psimd (heads/master) 2025-12-04T09:21:53.9424863Z 4fe0e1e183925bf8cfa6aae24237e724a96479b8 third_party/pthreadpool (0.1-144-g4fe0e1e) 2025-12-04T09:21:53.9458151Z f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8 third_party/pybind11 (v3.0.1) 2025-12-04T09:21:53.9559178Z f45429b087dd7d5bc78bb40dc7cf06425c252d67 third_party/python-peachpy (remotes/origin/pre-generated) 2025-12-04T09:21:53.9637465Z 5a1d179df9cf652951b59010a2d2075372d67f68 third_party/sleef (3.8) 2025-12-04T09:21:53.9724919Z 2b4cd91092d335a697416b2a3cb398283246849d third_party/tensorpipe (heads/main) 2025-12-04T09:21:53.9739958Z ##[group]Cleaning the repository 2025-12-04T09:21:53.9744167Z [command]/usr/bin/git clean -ffdx 2025-12-04T09:21:53.9995244Z [command]/usr/bin/git reset --hard HEAD 2025-12-04T09:21:54.0898057Z HEAD is now at 135f3753c41 [xpu][feature] [3/3] Register the `scaled_mm` and `scaled_mm_v2` for xpu (#166056) 2025-12-04T09:21:54.0927424Z ##[endgroup] 2025-12-04T09:21:54.0929243Z ##[group]Disabling automatic garbage collection 2025-12-04T09:21:54.0934681Z [command]/usr/bin/git config --local gc.auto 0 2025-12-04T09:21:54.0959562Z ##[endgroup] 2025-12-04T09:21:54.0960097Z ##[group]Setting up auth 2025-12-04T09:21:54.0964345Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T09:21:54.1015445Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T09:21:54.1289106Z Entering 'android/libs/fbjni' 2025-12-04T09:21:54.1339405Z Entering 'third_party/FP16' 2025-12-04T09:21:54.1379114Z Entering 'third_party/FXdiv' 2025-12-04T09:21:54.1426205Z Entering 'third_party/NNPACK' 2025-12-04T09:21:54.1464067Z Entering 'third_party/NVTX' 2025-12-04T09:21:54.1523408Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:21:54.1578065Z Entering 'third_party/XNNPACK' 2025-12-04T09:21:54.1638078Z Entering 'third_party/aiter' 2025-12-04T09:21:54.1684017Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:21:54.1745141Z Entering 'third_party/benchmark' 2025-12-04T09:21:54.1793846Z Entering 'third_party/composable_kernel' 2025-12-04T09:21:54.1851279Z Entering 'third_party/cpp-httplib' 2025-12-04T09:21:54.1897227Z Entering 'third_party/cpuinfo' 2025-12-04T09:21:54.1947627Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:21:54.1984190Z Entering 'third_party/cutlass' 2025-12-04T09:21:54.2039735Z Entering 'third_party/fbgemm' 2025-12-04T09:21:54.2084371Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:21:54.2125466Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:21:54.2172663Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:21:54.2215139Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:21:54.2271187Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:21:54.2313363Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:21:54.2356309Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:21:54.2403545Z Entering 'third_party/flash-attention' 2025-12-04T09:21:54.2451645Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:21:54.2501987Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:21:54.2544737Z Entering 'third_party/flatbuffers' 2025-12-04T09:21:54.2604055Z Entering 'third_party/fmt' 2025-12-04T09:21:54.2652735Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:21:54.2684229Z Entering 'third_party/gloo' 2025-12-04T09:21:54.2733134Z Entering 'third_party/googletest' 2025-12-04T09:21:54.2783569Z Entering 'third_party/ideep' 2025-12-04T09:21:54.2826029Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:21:54.2876738Z Entering 'third_party/ittapi' 2025-12-04T09:21:54.2917023Z Entering 'third_party/kineto' 2025-12-04T09:21:54.2965746Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:21:54.3013320Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:21:54.3060101Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:21:54.3094694Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:21:54.3126627Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:21:54.3167512Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:21:54.3207872Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:21:54.3245184Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:21:54.3285353Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:21:54.3327613Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:21:54.3364890Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:21:54.3403012Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:21:54.3454561Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:21:54.3513475Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:21:54.3570439Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:21:54.3617674Z Entering 'third_party/kleidiai' 2025-12-04T09:21:54.3667282Z Entering 'third_party/mimalloc' 2025-12-04T09:21:54.3711729Z Entering 'third_party/nlohmann' 2025-12-04T09:21:54.3758345Z Entering 'third_party/onnx' 2025-12-04T09:21:54.3818851Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:21:54.3874866Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:21:54.3918447Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:21:54.3970570Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:21:54.4009062Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:21:54.4048890Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:21:54.4090750Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:21:54.4137553Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:21:54.4191358Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:21:54.4249036Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:21:54.4312067Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:21:54.4349030Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:21:54.4401571Z Entering 'third_party/pocketfft' 2025-12-04T09:21:54.4436664Z Entering 'third_party/protobuf' 2025-12-04T09:21:54.4470886Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:21:54.4514694Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:21:54.4553024Z Entering 'third_party/psimd' 2025-12-04T09:21:54.4598039Z Entering 'third_party/pthreadpool' 2025-12-04T09:21:54.4639439Z Entering 'third_party/pybind11' 2025-12-04T09:21:54.4682145Z Entering 'third_party/python-peachpy' 2025-12-04T09:21:54.4729668Z Entering 'third_party/sleef' 2025-12-04T09:21:54.4768827Z Entering 'third_party/tensorpipe' 2025-12-04T09:21:54.4810262Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:21:54.4850772Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:21:54.4885704Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:21:54.4918444Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:21:54.4962688Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:21:54.5026756Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T09:21:54.5052750Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T09:21:54.5282014Z Entering 'android/libs/fbjni' 2025-12-04T09:21:54.5329276Z Entering 'third_party/FP16' 2025-12-04T09:21:54.5367268Z Entering 'third_party/FXdiv' 2025-12-04T09:21:54.5409451Z Entering 'third_party/NNPACK' 2025-12-04T09:21:54.5453003Z Entering 'third_party/NVTX' 2025-12-04T09:21:54.5489370Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:21:54.5530092Z Entering 'third_party/XNNPACK' 2025-12-04T09:21:54.5586600Z Entering 'third_party/aiter' 2025-12-04T09:21:54.5619547Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:21:54.5666743Z Entering 'third_party/benchmark' 2025-12-04T09:21:54.5702185Z Entering 'third_party/composable_kernel' 2025-12-04T09:21:54.5749931Z Entering 'third_party/cpp-httplib' 2025-12-04T09:21:54.5788811Z Entering 'third_party/cpuinfo' 2025-12-04T09:21:54.5828893Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:21:54.5864351Z Entering 'third_party/cutlass' 2025-12-04T09:21:54.5901503Z Entering 'third_party/fbgemm' 2025-12-04T09:21:54.5945089Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:21:54.5981875Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:21:54.6020599Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:21:54.6064747Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:21:54.6102880Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:21:54.6141931Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:21:54.6174512Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:21:54.6213080Z Entering 'third_party/flash-attention' 2025-12-04T09:21:54.6255712Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:21:54.6309955Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:21:54.6354461Z Entering 'third_party/flatbuffers' 2025-12-04T09:21:54.6394208Z Entering 'third_party/fmt' 2025-12-04T09:21:54.6432273Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:21:54.6471873Z Entering 'third_party/gloo' 2025-12-04T09:21:54.6523817Z Entering 'third_party/googletest' 2025-12-04T09:21:54.6566342Z Entering 'third_party/ideep' 2025-12-04T09:21:54.6608945Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:21:54.6664088Z Entering 'third_party/ittapi' 2025-12-04T09:21:54.6707239Z Entering 'third_party/kineto' 2025-12-04T09:21:54.6743726Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:21:54.6781847Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:21:54.6817937Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:21:54.6851039Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:21:54.6882905Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:21:54.6932906Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:21:54.6982491Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:21:54.7030099Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:21:54.7065300Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:21:54.7105875Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:21:54.7145169Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:21:54.7184229Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:21:54.7225914Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:21:54.7274269Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:21:54.7313951Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:21:54.7356516Z Entering 'third_party/kleidiai' 2025-12-04T09:21:54.7414270Z Entering 'third_party/mimalloc' 2025-12-04T09:21:54.7463786Z Entering 'third_party/nlohmann' 2025-12-04T09:21:54.7527249Z Entering 'third_party/onnx' 2025-12-04T09:21:54.7585474Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:21:54.7645073Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:21:54.7690871Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:21:54.7732843Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:21:54.7776108Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:21:54.7814521Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:21:54.7866377Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:21:54.7908758Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:21:54.7946895Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:21:54.7992975Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:21:54.8039592Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:21:54.8082607Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:21:54.8136217Z Entering 'third_party/pocketfft' 2025-12-04T09:21:54.8185971Z Entering 'third_party/protobuf' 2025-12-04T09:21:54.8234055Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:21:54.8283061Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:21:54.8332481Z Entering 'third_party/psimd' 2025-12-04T09:21:54.8376606Z Entering 'third_party/pthreadpool' 2025-12-04T09:21:54.8417854Z Entering 'third_party/pybind11' 2025-12-04T09:21:54.8457229Z Entering 'third_party/python-peachpy' 2025-12-04T09:21:54.8497239Z Entering 'third_party/sleef' 2025-12-04T09:21:54.8548148Z Entering 'third_party/tensorpipe' 2025-12-04T09:21:54.8588799Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:21:54.8631349Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:21:54.8676009Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:21:54.8720051Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:21:54.8751485Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:21:54.8812610Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:54.8842208Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T09:21:54.9105912Z Entering 'android/libs/fbjni' 2025-12-04T09:21:54.9126456Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T09:21:54.9148649Z Entering 'third_party/FP16' 2025-12-04T09:21:54.9169704Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T09:21:54.9184476Z Entering 'third_party/FXdiv' 2025-12-04T09:21:54.9201590Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T09:21:54.9219907Z Entering 'third_party/NNPACK' 2025-12-04T09:21:54.9240296Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T09:21:54.9258479Z Entering 'third_party/NVTX' 2025-12-04T09:21:54.9275286Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T09:21:54.9297519Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:21:54.9320120Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T09:21:54.9340485Z Entering 'third_party/XNNPACK' 2025-12-04T09:21:54.9366561Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T09:21:54.9394886Z Entering 'third_party/aiter' 2025-12-04T09:21:54.9415381Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T09:21:54.9438724Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:21:54.9458975Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T09:21:54.9491194Z Entering 'third_party/benchmark' 2025-12-04T09:21:54.9507138Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:21:54.9530591Z Entering 'third_party/composable_kernel' 2025-12-04T09:21:54.9546327Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T09:21:54.9573926Z Entering 'third_party/cpp-httplib' 2025-12-04T09:21:54.9599038Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T09:21:54.9617643Z Entering 'third_party/cpuinfo' 2025-12-04T09:21:54.9638943Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T09:21:54.9658365Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:21:54.9688283Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T09:21:54.9708086Z Entering 'third_party/cutlass' 2025-12-04T09:21:54.9733682Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T09:21:54.9763451Z Entering 'third_party/fbgemm' 2025-12-04T09:21:54.9783372Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T09:21:54.9805542Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:21:54.9827615Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T09:21:54.9850317Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:21:54.9865902Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T09:21:54.9896424Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:21:54.9938716Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T09:21:54.9939322Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:21:54.9948915Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T09:21:54.9970463Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:21:54.9994202Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T09:21:55.0011326Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:21:55.0034379Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T09:21:55.0057453Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:21:55.0073890Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T09:21:55.0091985Z Entering 'third_party/flash-attention' 2025-12-04T09:21:55.0114243Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T09:21:55.0131269Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:21:55.0154661Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T09:21:55.0178090Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:21:55.0203397Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T09:21:55.0231775Z Entering 'third_party/flatbuffers' 2025-12-04T09:21:55.0250709Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T09:21:55.0271995Z Entering 'third_party/fmt' 2025-12-04T09:21:55.0292283Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T09:21:55.0310364Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:21:55.0328776Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T09:21:55.0349685Z Entering 'third_party/gloo' 2025-12-04T09:21:55.0374637Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T09:21:55.0395568Z Entering 'third_party/googletest' 2025-12-04T09:21:55.0411794Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:21:55.0430362Z Entering 'third_party/ideep' 2025-12-04T09:21:55.0450828Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T09:21:55.0469515Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:21:55.0494742Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T09:21:55.0529222Z Entering 'third_party/ittapi' 2025-12-04T09:21:55.0547763Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T09:21:55.0570224Z Entering 'third_party/kineto' 2025-12-04T09:21:55.0596213Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T09:21:55.0618191Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:21:55.0645576Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T09:21:55.0662449Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:21:55.0687787Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T09:21:55.0710306Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:21:55.0735358Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T09:21:55.0749934Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:21:55.0765911Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T09:21:55.0780827Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:21:55.0806360Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T09:21:55.0828611Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:21:55.0850182Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T09:21:55.0871812Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:21:55.0890038Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T09:21:55.0909305Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:21:55.0932757Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:21:55.0951356Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:21:55.0982381Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T09:21:55.1004305Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:21:55.1028156Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T09:21:55.1050443Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:21:55.1070243Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T09:21:55.1089786Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:21:55.1119559Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T09:21:55.1136915Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:21:55.1157902Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T09:21:55.1189774Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:21:55.1202391Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T09:21:55.1221270Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:21:55.1241591Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T09:21:55.1267546Z Entering 'third_party/kleidiai' 2025-12-04T09:21:55.1285661Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T09:21:55.1306264Z Entering 'third_party/mimalloc' 2025-12-04T09:21:55.1324333Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T09:21:55.1344674Z Entering 'third_party/nlohmann' 2025-12-04T09:21:55.1370187Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T09:21:55.1392346Z Entering 'third_party/onnx' 2025-12-04T09:21:55.1412906Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T09:21:55.1444103Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:21:55.1468149Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:21:55.1497300Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:21:55.1515106Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T09:21:55.1534162Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:21:55.1559137Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:21:55.1575500Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:21:55.1593212Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:21:55.1609905Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:21:55.1629677Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T09:21:55.1651838Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:21:55.1668489Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T09:21:55.1685498Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:21:55.1708185Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T09:21:55.1721181Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:21:55.1741475Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T09:21:55.1761251Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:21:55.1779363Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T09:21:55.1800663Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:21:55.1822168Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T09:21:55.1844080Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:21:55.1870985Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T09:21:55.1893721Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:21:55.1918020Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T09:21:55.1950560Z Entering 'third_party/pocketfft' 2025-12-04T09:21:55.1976824Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T09:21:55.1997853Z Entering 'third_party/protobuf' 2025-12-04T09:21:55.2012746Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T09:21:55.2035741Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:21:55.2052032Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:21:55.2075141Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:21:55.2091499Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:21:55.2110598Z Entering 'third_party/psimd' 2025-12-04T09:21:55.2129614Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T09:21:55.2146487Z Entering 'third_party/pthreadpool' 2025-12-04T09:21:55.2162698Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T09:21:55.2177974Z Entering 'third_party/pybind11' 2025-12-04T09:21:55.2198179Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:21:55.2214449Z Entering 'third_party/python-peachpy' 2025-12-04T09:21:55.2234817Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T09:21:55.2252750Z Entering 'third_party/sleef' 2025-12-04T09:21:55.2277975Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T09:21:55.2295124Z Entering 'third_party/tensorpipe' 2025-12-04T09:21:55.2319677Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T09:21:55.2336223Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:21:55.2358938Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:21:55.2375722Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:21:55.2400290Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T09:21:55.2415399Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:21:55.2431573Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T09:21:55.2451052Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:21:55.2467585Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:21:55.2483442Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:21:55.2503702Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T09:21:55.2550801Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2581298Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2610484Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2640359Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2665744Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2685475Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2717949Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2736929Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2760803Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2785232Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2804541Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2823530Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2843207Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2862580Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2890064Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2914801Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2940616Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2966630Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.2991031Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3019261Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3041504Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3060671Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3079210Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3097645Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3115547Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3135187Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3154244Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3174078Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3193420Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3212866Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3237410Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3260168Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3285087Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3309343Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3328690Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3351361Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3377835Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3396256Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3419264Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3442313Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3466891Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3490609Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3511980Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3540399Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3561998Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3590376Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3612772Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3633857Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3657431Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3680597Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3699883Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3721163Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3739951Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3761699Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3787141Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3809930Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3831325Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3852296Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3876515Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3899034Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3921806Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3943382Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3971974Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.3993650Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4021088Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4042810Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4067737Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4092235Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4118628Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4142129Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4167467Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4186916Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4206359Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4234119Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4256720Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4280213Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4299854Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4323224Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4348792Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4371895Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4393179Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:21:55.4416664Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T09:21:55.4450878Z ##[endgroup] 2025-12-04T09:21:55.4451133Z ##[group]Fetching the repository 2025-12-04T09:21:55.4456004Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-12-04T09:21:56.0005347Z From https://github.com/pytorch/pytorch 2025-12-04T09:21:56.0006027Z - [deleted] (none) -> origin/IvanKobzarev/stack/2 2025-12-04T09:21:56.0543392Z - [deleted] (none) -> origin/add_conv3d_ops 2025-12-04T09:21:56.0544174Z - [deleted] (none) -> origin/bf/partition-custom-rules 2025-12-04T09:21:56.0545083Z - [deleted] (none) -> origin/codex/create-test-for-tensor-memory-leak-in-cudagraph 2025-12-04T09:21:56.0545870Z - [deleted] (none) -> origin/export-D58091437 2025-12-04T09:21:56.0546482Z - [deleted] (none) -> origin/export-D84025878 2025-12-04T09:21:56.0547953Z - [deleted] (none) -> origin/export-D84169910 2025-12-04T09:21:56.0549277Z - [deleted] (none) -> origin/gh/IvanKobzarev/158/base 2025-12-04T09:21:56.0550510Z - [deleted] (none) -> origin/gh/IvanKobzarev/158/head 2025-12-04T09:21:56.0551848Z - [deleted] (none) -> origin/gh/IvanKobzarev/158/orig 2025-12-04T09:21:56.0552971Z - [deleted] (none) -> origin/gh/IvanKobzarev/164/base 2025-12-04T09:21:56.0553777Z - [deleted] (none) -> origin/gh/IvanKobzarev/164/head 2025-12-04T09:21:56.0554745Z - [deleted] (none) -> origin/gh/IvanKobzarev/164/orig 2025-12-04T09:21:56.0555852Z - [deleted] (none) -> origin/gh/PaulZhang12/41/base 2025-12-04T09:21:56.0556947Z - [deleted] (none) -> origin/gh/PaulZhang12/41/head 2025-12-04T09:21:56.0558188Z - [deleted] (none) -> origin/gh/PaulZhang12/41/orig 2025-12-04T09:21:56.0559182Z - [deleted] (none) -> origin/gh/XuehaiPan/395/base 2025-12-04T09:21:56.0560380Z - [deleted] (none) -> origin/gh/XuehaiPan/395/head 2025-12-04T09:21:56.0561495Z - [deleted] (none) -> origin/gh/XuehaiPan/395/orig 2025-12-04T09:21:56.0562512Z - [deleted] (none) -> origin/gh/XuehaiPan/396/base 2025-12-04T09:21:56.0563693Z - [deleted] (none) -> origin/gh/XuehaiPan/396/orig 2025-12-04T09:21:56.0564800Z - [deleted] (none) -> origin/gh/andyanwang/43/base 2025-12-04T09:21:56.0565908Z - [deleted] (none) -> origin/gh/andyanwang/43/head 2025-12-04T09:21:56.0571637Z - [deleted] (none) -> origin/gh/andyanwang/43/orig 2025-12-04T09:21:56.0572363Z - [deleted] (none) -> origin/gh/andyanwang/44/base 2025-12-04T09:21:56.0573362Z - [deleted] (none) -> origin/gh/andyanwang/44/head 2025-12-04T09:21:56.0574207Z - [deleted] (none) -> origin/gh/andyanwang/44/orig 2025-12-04T09:21:56.0575376Z - [deleted] (none) -> origin/gh/angelayi/130/base 2025-12-04T09:21:56.0577075Z - [deleted] (none) -> origin/gh/angelayi/130/head 2025-12-04T09:21:56.0577687Z - [deleted] (none) -> origin/gh/angelayi/130/orig 2025-12-04T09:21:56.0578633Z - [deleted] (none) -> origin/gh/anshul-si/61/base 2025-12-04T09:21:56.0579846Z - [deleted] (none) -> origin/gh/anshul-si/61/head 2025-12-04T09:21:56.0580936Z - [deleted] (none) -> origin/gh/anshul-si/61/orig 2025-12-04T09:21:56.0582010Z - [deleted] (none) -> origin/gh/anshul-si/62/base 2025-12-04T09:21:56.0583198Z - [deleted] (none) -> origin/gh/anshul-si/62/head 2025-12-04T09:21:56.0584278Z - [deleted] (none) -> origin/gh/anshul-si/62/orig 2025-12-04T09:21:56.0585494Z - [deleted] (none) -> origin/gh/anshul-si/63/base 2025-12-04T09:21:56.0586593Z - [deleted] (none) -> origin/gh/anshul-si/63/head 2025-12-04T09:21:56.0587683Z - [deleted] (none) -> origin/gh/anshul-si/63/orig 2025-12-04T09:21:56.0588865Z - [deleted] (none) -> origin/gh/ezyang/3177/base 2025-12-04T09:21:56.0589947Z - [deleted] (none) -> origin/gh/ezyang/3177/head 2025-12-04T09:21:56.0591039Z - [deleted] (none) -> origin/gh/ezyang/3177/orig 2025-12-04T09:21:56.0592209Z - [deleted] (none) -> origin/gh/ezyang/3184/base 2025-12-04T09:21:56.0593299Z - [deleted] (none) -> origin/gh/ezyang/3184/head 2025-12-04T09:21:56.0594508Z - [deleted] (none) -> origin/gh/ezyang/3184/orig 2025-12-04T09:21:56.0595614Z - [deleted] (none) -> origin/gh/ezyang/3187/base 2025-12-04T09:21:56.0596746Z - [deleted] (none) -> origin/gh/ezyang/3187/head 2025-12-04T09:21:56.0597870Z - [deleted] (none) -> origin/gh/ezyang/3187/orig 2025-12-04T09:21:56.0598957Z - [deleted] (none) -> origin/gh/ezyang/3188/base 2025-12-04T09:21:56.0600115Z - [deleted] (none) -> origin/gh/ezyang/3188/head 2025-12-04T09:21:56.0601182Z - [deleted] (none) -> origin/gh/ezyang/3188/orig 2025-12-04T09:21:56.0602347Z - [deleted] (none) -> origin/gh/ezyang/3190/base 2025-12-04T09:21:56.0603524Z - [deleted] (none) -> origin/gh/ezyang/3190/head 2025-12-04T09:21:56.0604613Z - [deleted] (none) -> origin/gh/ezyang/3190/orig 2025-12-04T09:21:56.0605773Z - [deleted] (none) -> origin/gh/fxdawnn/8/base 2025-12-04T09:21:56.0606981Z - [deleted] (none) -> origin/gh/fxdawnn/8/head 2025-12-04T09:21:56.0608103Z - [deleted] (none) -> origin/gh/fxdawnn/8/orig 2025-12-04T09:21:56.0609285Z - [deleted] (none) -> origin/gh/guilhermeleobas/239/base 2025-12-04T09:21:56.0610361Z - [deleted] (none) -> origin/gh/guilhermeleobas/239/head 2025-12-04T09:21:56.0611451Z - [deleted] (none) -> origin/gh/guilhermeleobas/239/orig 2025-12-04T09:21:56.0612619Z - [deleted] (none) -> origin/gh/malfet/572/base 2025-12-04T09:21:56.0613712Z - [deleted] (none) -> origin/gh/malfet/572/head 2025-12-04T09:21:56.0614903Z - [deleted] (none) -> origin/gh/malfet/572/orig 2025-12-04T09:21:56.0616001Z - [deleted] (none) -> origin/gh/malfet/582/base 2025-12-04T09:21:56.0617121Z - [deleted] (none) -> origin/gh/malfet/582/head 2025-12-04T09:21:56.0618221Z - [deleted] (none) -> origin/gh/malfet/582/orig 2025-12-04T09:21:56.0619344Z - [deleted] (none) -> origin/gh/mikaylagawarecki/337/base 2025-12-04T09:21:56.0620554Z - [deleted] (none) -> origin/gh/mikaylagawarecki/337/head 2025-12-04T09:21:56.0621638Z - [deleted] (none) -> origin/gh/mikaylagawarecki/337/orig 2025-12-04T09:21:56.0623228Z - [deleted] (none) -> origin/gh/pianpwk/27/base 2025-12-04T09:21:56.0623864Z - [deleted] (none) -> origin/gh/pianpwk/27/head 2025-12-04T09:21:56.0624969Z - [deleted] (none) -> origin/gh/pianpwk/27/orig 2025-12-04T09:21:56.0626114Z - [deleted] (none) -> origin/gh/shunting314/238/base 2025-12-04T09:21:56.0627966Z - [deleted] (none) -> origin/gh/shunting314/238/head 2025-12-04T09:21:56.0628568Z - [deleted] (none) -> origin/gh/shunting314/238/orig 2025-12-04T09:21:56.0629482Z - [deleted] (none) -> origin/gh/wconstab/438/base 2025-12-04T09:21:56.0630587Z - [deleted] (none) -> origin/gh/wconstab/438/head 2025-12-04T09:21:56.0631755Z - [deleted] (none) -> origin/gh/wconstab/438/orig 2025-12-04T09:21:56.0632880Z - [deleted] (none) -> origin/gh/williamwen42/317/base 2025-12-04T09:21:56.0634125Z - [deleted] (none) -> origin/gh/williamwen42/317/head 2025-12-04T09:21:56.0635155Z - [deleted] (none) -> origin/gh/williamwen42/317/orig 2025-12-04T09:21:56.0636239Z - [deleted] (none) -> origin/gh/williamwen42/318/base 2025-12-04T09:21:56.0637447Z - [deleted] (none) -> origin/gh/williamwen42/318/head 2025-12-04T09:21:56.0638560Z - [deleted] (none) -> origin/gh/williamwen42/318/orig 2025-12-04T09:21:56.0639706Z - [deleted] (none) -> origin/gh/williamwen42/322/base 2025-12-04T09:21:56.0640782Z - [deleted] (none) -> origin/gh/williamwen42/322/head 2025-12-04T09:21:56.0641875Z - [deleted] (none) -> origin/gh/williamwen42/322/orig 2025-12-04T09:21:56.0643059Z - [deleted] (none) -> origin/gh/zhxchen17/36/base 2025-12-04T09:21:56.0644200Z - [deleted] (none) -> origin/gh/zhxchen17/36/head 2025-12-04T09:21:56.0645327Z - [deleted] (none) -> origin/gh/zhxchen17/36/orig 2025-12-04T09:21:56.0646472Z - [deleted] (none) -> origin/gh/zhxchen17/37/base 2025-12-04T09:21:56.0647636Z - [deleted] (none) -> origin/gh/zhxchen17/37/head 2025-12-04T09:21:56.0648746Z - [deleted] (none) -> origin/gh/zhxchen17/37/orig 2025-12-04T09:21:56.0649903Z - [deleted] (none) -> origin/gh/zhxchen17/38/base 2025-12-04T09:21:56.0650989Z - [deleted] (none) -> origin/gh/zhxchen17/38/head 2025-12-04T09:21:56.0652068Z - [deleted] (none) -> origin/gh/zhxchen17/38/orig 2025-12-04T09:21:56.0653350Z - [deleted] (none) -> origin/lucaskabela/typinng_variables_builtin 2025-12-04T09:21:56.0654456Z - [deleted] (none) -> origin/nmacchioni-40p-machine-balance 2025-12-04T09:21:56.0655543Z - [deleted] (none) -> origin/nmacchioni-60p-machine-balance 2025-12-04T09:21:56.0656792Z - [deleted] (none) -> origin/nmacchioni-padmm-no-compute-bound-check 2025-12-04T09:21:56.0657778Z - [deleted] (none) -> origin/revert-cpp 2025-12-04T09:21:56.0658940Z - [deleted] (none) -> origin/try-removing-ifndef-in-shim_common 2025-12-04T09:21:56.0660038Z - [deleted] (none) -> origin/update-xla-commit-hash/19027297646-210-1 2025-12-04T09:21:56.0661145Z - [deleted] (none) -> origin/zxiiro/c7i-docs-build 2025-12-04T09:21:56.0662306Z - [deleted] (none) -> origin/zxiiro/c7i-pull 2025-12-04T09:21:56.0663399Z - [deleted] (none) -> ciflow/binaries/169307 2025-12-04T09:21:56.0664499Z - [deleted] (none) -> ciflow/binaries/169444 2025-12-04T09:21:56.0665593Z - [deleted] (none) -> ciflow/dynamo/168131 2025-12-04T09:21:56.0666773Z - [deleted] (none) -> ciflow/dynamo/168203 2025-12-04T09:21:56.0668181Z - [deleted] (none) -> ciflow/dynamo/168886 2025-12-04T09:21:56.0669294Z - [deleted] (none) -> ciflow/inductor-perf-compare/165029 2025-12-04T09:21:56.0670244Z - [deleted] (none) -> ciflow/inductor-perf-test-nightly-rocm-mi300/162052 2025-12-04T09:21:56.0671381Z - [deleted] (none) -> ciflow/inductor-perf-test-nightly-x86-zen/164126 2025-12-04T09:21:56.0672445Z - [deleted] (none) -> ciflow/inductor-perf-test-nightly/165029 2025-12-04T09:21:56.0673565Z - [deleted] (none) -> ciflow/inductor-periodic/165029 2025-12-04T09:21:56.0674683Z - [deleted] (none) -> ciflow/inductor-rocm/159146 2025-12-04T09:21:56.0676012Z - [deleted] (none) -> ciflow/inductor/159146 2025-12-04T09:21:56.0677109Z - [deleted] (none) -> ciflow/inductor/160174 2025-12-04T09:21:56.0678047Z - [deleted] (none) -> ciflow/inductor/162666 2025-12-04T09:21:56.0679233Z - [deleted] (none) -> ciflow/inductor/163774 2025-12-04T09:21:56.0680290Z - [deleted] (none) -> ciflow/inductor/165029 2025-12-04T09:21:56.0681439Z - [deleted] (none) -> ciflow/inductor/165643 2025-12-04T09:21:56.0682508Z - [deleted] (none) -> ciflow/inductor/167294 2025-12-04T09:21:56.0683644Z - [deleted] (none) -> ciflow/inductor/167364 2025-12-04T09:21:56.0684781Z - [deleted] (none) -> ciflow/inductor/167542 2025-12-04T09:21:56.0685896Z - [deleted] (none) -> ciflow/inductor/167741 2025-12-04T09:21:56.0691765Z - [deleted] (none) -> ciflow/inductor/168082 2025-12-04T09:21:56.0692443Z - [deleted] (none) -> ciflow/inductor/168131 2025-12-04T09:21:56.0693268Z - [deleted] (none) -> ciflow/inductor/168146 2025-12-04T09:21:56.0694460Z - [deleted] (none) -> ciflow/inductor/168203 2025-12-04T09:21:56.0695490Z - [deleted] (none) -> ciflow/inductor/168386 2025-12-04T09:21:56.0696664Z - [deleted] (none) -> ciflow/inductor/168886 2025-12-04T09:21:56.0697743Z - [deleted] (none) -> ciflow/inductor/169070 2025-12-04T09:21:56.0699004Z - [deleted] (none) -> ciflow/inductor/169114 2025-12-04T09:21:56.0700101Z - [deleted] (none) -> ciflow/inductor/169199 2025-12-04T09:21:56.0701245Z - [deleted] (none) -> ciflow/inductor/169225 2025-12-04T09:21:56.0702475Z - [deleted] (none) -> ciflow/inductor/169306 2025-12-04T09:21:56.0703538Z - [deleted] (none) -> ciflow/inductor/169347 2025-12-04T09:21:56.0704662Z - [deleted] (none) -> ciflow/inductor/169351 2025-12-04T09:21:56.0705877Z - [deleted] (none) -> ciflow/inductor/169369 2025-12-04T09:21:56.0706879Z - [deleted] (none) -> ciflow/inductor/169406 2025-12-04T09:21:56.0708055Z - [deleted] (none) -> ciflow/inductor/169421 2025-12-04T09:21:56.0709166Z - [deleted] (none) -> ciflow/inductor/169422 2025-12-04T09:21:56.0710278Z - [deleted] (none) -> ciflow/inductor/169426 2025-12-04T09:21:56.0711468Z - [deleted] (none) -> ciflow/inductor/169449 2025-12-04T09:21:56.0712561Z - [deleted] (none) -> ciflow/linux-aarch64/165643 2025-12-04T09:21:56.0713707Z - [deleted] (none) -> ciflow/linux-aarch64/167069 2025-12-04T09:21:56.0714778Z - [deleted] (none) -> ciflow/mps/167569 2025-12-04T09:21:56.0715840Z - [deleted] (none) -> ciflow/mps/168120 2025-12-04T09:21:56.0716998Z - [deleted] (none) -> ciflow/mps/168332 2025-12-04T09:21:56.0718197Z - [deleted] (none) -> ciflow/mps/169407 2025-12-04T09:21:56.0719546Z - [deleted] (none) -> ciflow/op-benchmark/168216 2025-12-04T09:21:56.0720432Z - [deleted] (none) -> ciflow/rocm-mi300/159146 2025-12-04T09:21:56.0721491Z - [deleted] (none) -> ciflow/rocm/159146 2025-12-04T09:21:56.0722869Z - [deleted] (none) -> ciflow/rocm/169300 2025-12-04T09:21:56.0723824Z - [deleted] (none) -> ciflow/rocm/169369 2025-12-04T09:21:56.0724900Z - [deleted] (none) -> ciflow/trunk/162706 2025-12-04T09:21:56.0726060Z - [deleted] (none) -> ciflow/trunk/163774 2025-12-04T09:21:56.0727265Z - [deleted] (none) -> ciflow/trunk/165029 2025-12-04T09:21:56.0728368Z - [deleted] (none) -> ciflow/trunk/167364 2025-12-04T09:21:56.0729522Z - [deleted] (none) -> ciflow/trunk/167397 2025-12-04T09:21:56.0730621Z - [deleted] (none) -> ciflow/trunk/167569 2025-12-04T09:21:56.0731705Z - [deleted] (none) -> ciflow/trunk/167828 2025-12-04T09:21:56.0732896Z - [deleted] (none) -> ciflow/trunk/168082 2025-12-04T09:21:56.0733932Z - [deleted] (none) -> ciflow/trunk/168146 2025-12-04T09:21:56.0735139Z - [deleted] (none) -> ciflow/trunk/168382 2025-12-04T09:21:56.0736218Z - [deleted] (none) -> ciflow/trunk/168389 2025-12-04T09:21:56.0737385Z - [deleted] (none) -> ciflow/trunk/168899 2025-12-04T09:21:56.0738507Z - [deleted] (none) -> ciflow/trunk/168966 2025-12-04T09:21:56.0739598Z - [deleted] (none) -> ciflow/trunk/169043 2025-12-04T09:21:56.0740675Z - [deleted] (none) -> ciflow/trunk/169070 2025-12-04T09:21:56.0741886Z - [deleted] (none) -> ciflow/trunk/169086 2025-12-04T09:21:56.0742965Z - [deleted] (none) -> ciflow/trunk/169199 2025-12-04T09:21:56.0744154Z - [deleted] (none) -> ciflow/trunk/169225 2025-12-04T09:21:56.0745220Z - [deleted] (none) -> ciflow/trunk/169229 2025-12-04T09:21:56.0746375Z - [deleted] (none) -> ciflow/trunk/169307 2025-12-04T09:21:56.0747505Z - [deleted] (none) -> ciflow/trunk/169322 2025-12-04T09:21:56.0748597Z - [deleted] (none) -> ciflow/trunk/169369 2025-12-04T09:21:56.0749760Z - [deleted] (none) -> ciflow/trunk/169396 2025-12-04T09:21:56.0750850Z - [deleted] (none) -> ciflow/trunk/169406 2025-12-04T09:21:56.0751999Z - [deleted] (none) -> ciflow/trunk/169421 2025-12-04T09:21:56.0753102Z - [deleted] (none) -> ciflow/trunk/169422 2025-12-04T09:21:56.0754279Z - [deleted] (none) -> ciflow/trunk/169433 2025-12-04T09:21:56.0755391Z - [deleted] (none) -> ciflow/xpu/165766 2025-12-04T09:21:56.0756533Z - [deleted] (none) -> ciflow/xpu/167057 2025-12-04T09:21:56.0757630Z - [deleted] (none) -> ciflow/xpu/168952 2025-12-04T09:21:56.0758738Z - [deleted] (none) -> ciflow/xpu/168966 2025-12-04T09:21:56.0759816Z - [deleted] (none) -> ciflow/xpu/169043 2025-12-04T09:21:56.0761029Z - [deleted] (none) -> ciflow/xpu/169229 2025-12-04T09:21:56.0762124Z - [deleted] (none) -> ciflow/xpu/169257 2025-12-04T09:21:56.0763196Z - [deleted] (none) -> ciflow/xpu/169322 2025-12-04T09:21:56.0764350Z - [deleted] (none) -> ciflow/xpu/169443 2025-12-04T09:21:59.8982674Z * [new branch] activation_bench -> origin/activation_bench 2025-12-04T09:21:59.8994118Z + d55094b9b55...786e731fdbe angelayi/aot_compile -> origin/angelayi/aot_compile (forced update) 2025-12-04T09:21:59.8997066Z + 931c8800f30...0054b7e7d40 angelayi/side_eff -> origin/angelayi/side_eff (forced update) 2025-12-04T09:21:59.8998109Z * [new branch] aoti_const_device -> origin/aoti_const_device 2025-12-04T09:21:59.9002208Z * [new branch] crpa/typo-in-inductor_comm_lowering -> origin/crpa/typo-in-inductor_comm_lowering 2025-12-04T09:21:59.9004195Z + 6c0b427a3f7...7b852a8c5fd eqy-patch-6 -> origin/eqy-patch-6 (forced update) 2025-12-04T09:21:59.9011386Z + b2498397896...6998629c573 export-D87838959 -> origin/export-D87838959 (forced update) 2025-12-04T09:21:59.9014166Z * [new branch] export-D88319437 -> origin/export-D88319437 2025-12-04T09:21:59.9024941Z 04d015c633c..cc0aa98554e gh/IvanKobzarev/174/base -> origin/gh/IvanKobzarev/174/base 2025-12-04T09:21:59.9029849Z 732701a870d..4e0ea89e9d7 gh/IvanKobzarev/174/head -> origin/gh/IvanKobzarev/174/head 2025-12-04T09:21:59.9032984Z + 86e264480e6...1cef815cbcf gh/IvanKobzarev/174/orig -> origin/gh/IvanKobzarev/174/orig (forced update) 2025-12-04T09:21:59.9037367Z df9a7400a4d..bc621aeb8e1 gh/IvanKobzarev/180/base -> origin/gh/IvanKobzarev/180/base 2025-12-04T09:21:59.9039549Z 52c06b865b6..ce375ac5318 gh/IvanKobzarev/180/head -> origin/gh/IvanKobzarev/180/head 2025-12-04T09:21:59.9043550Z + 1b071e6db22...a950a18e526 gh/IvanKobzarev/180/orig -> origin/gh/IvanKobzarev/180/orig (forced update) 2025-12-04T09:21:59.9049621Z 9648440cbce..d2a73037748 gh/SherlockNoMad/18/base -> origin/gh/SherlockNoMad/18/base 2025-12-04T09:21:59.9053373Z 590bdcd8b32..059d973f63b gh/SherlockNoMad/18/head -> origin/gh/SherlockNoMad/18/head 2025-12-04T09:21:59.9056282Z + 4fc4df167ae...aa0dfe0fd70 gh/SherlockNoMad/18/orig -> origin/gh/SherlockNoMad/18/orig (forced update) 2025-12-04T09:21:59.9059978Z * [new branch] gh/SherlockNoMad/20/base -> origin/gh/SherlockNoMad/20/base 2025-12-04T09:21:59.9062586Z * [new branch] gh/SherlockNoMad/20/head -> origin/gh/SherlockNoMad/20/head 2025-12-04T09:21:59.9065069Z * [new branch] gh/SherlockNoMad/20/orig -> origin/gh/SherlockNoMad/20/orig 2025-12-04T09:21:59.9068474Z * [new branch] gh/SherlockNoMad/21/base -> origin/gh/SherlockNoMad/21/base 2025-12-04T09:21:59.9071060Z * [new branch] gh/SherlockNoMad/21/head -> origin/gh/SherlockNoMad/21/head 2025-12-04T09:21:59.9073476Z * [new branch] gh/SherlockNoMad/21/orig -> origin/gh/SherlockNoMad/21/orig 2025-12-04T09:21:59.9079030Z 6785bdd3241..14bc7175e16 gh/XuehaiPan/400/base -> origin/gh/XuehaiPan/400/base 2025-12-04T09:21:59.9082383Z 5f543d3f682..dd45df28d65 gh/XuehaiPan/400/head -> origin/gh/XuehaiPan/400/head 2025-12-04T09:21:59.9085819Z + af7a66f0ae4...13309b8b80b gh/XuehaiPan/400/orig -> origin/gh/XuehaiPan/400/orig (forced update) 2025-12-04T09:21:59.9089764Z 72dd8d59092..07a53dcd83b gh/alexsamardzic/14/base -> origin/gh/alexsamardzic/14/base 2025-12-04T09:21:59.9092977Z 8d40128ca94..c29a8268fcd gh/alexsamardzic/14/head -> origin/gh/alexsamardzic/14/head 2025-12-04T09:21:59.9096213Z + 63f4bcc49fd...6aaf1495e55 gh/alexsamardzic/14/orig -> origin/gh/alexsamardzic/14/orig (forced update) 2025-12-04T09:21:59.9100859Z 5a7b00f8fe4..84fc355f5fc gh/angelayi/135/base -> origin/gh/angelayi/135/base 2025-12-04T09:21:59.9104126Z 9a263d9c05a..5df883fd2f0 gh/angelayi/135/head -> origin/gh/angelayi/135/head 2025-12-04T09:21:59.9107344Z + f343d2d8f2c...cbd019ea115 gh/angelayi/135/orig -> origin/gh/angelayi/135/orig (forced update) 2025-12-04T09:21:59.9110546Z fef61178394..585e20f0bd2 gh/angelayi/136/base -> origin/gh/angelayi/136/base 2025-12-04T09:21:59.9113761Z 3a0c47b7fa3..eb99f7436e5 gh/angelayi/136/head -> origin/gh/angelayi/136/head 2025-12-04T09:21:59.9116953Z + 1d9d49f5cc1...c6b9eda9eae gh/angelayi/136/orig -> origin/gh/angelayi/136/orig (forced update) 2025-12-04T09:21:59.9120463Z a08c63b521a..ea7b6101f8d gh/angelayi/137/base -> origin/gh/angelayi/137/base 2025-12-04T09:21:59.9123244Z 89faa301821..298c3617dc4 gh/angelayi/137/head -> origin/gh/angelayi/137/head 2025-12-04T09:21:59.9126791Z + b2f106da0af...30217d4fb29 gh/angelayi/137/orig -> origin/gh/angelayi/137/orig (forced update) 2025-12-04T09:21:59.9129849Z c1e935655ef..341a8a86d78 gh/angelayi/138/base -> origin/gh/angelayi/138/base 2025-12-04T09:21:59.9132982Z dcef61954a8..8f803c22f4c gh/angelayi/138/head -> origin/gh/angelayi/138/head 2025-12-04T09:21:59.9136187Z + aa556b09760...1fb2eb5bc9b gh/angelayi/138/orig -> origin/gh/angelayi/138/orig (forced update) 2025-12-04T09:21:59.9139335Z e424d9b7aff..89c41e2bdad gh/angelayi/139/base -> origin/gh/angelayi/139/base 2025-12-04T09:21:59.9142663Z cf9d6f4b137..365956052a8 gh/angelayi/139/head -> origin/gh/angelayi/139/head 2025-12-04T09:21:59.9145956Z + bfb8d6abc57...f1b136064fd gh/angelayi/139/orig -> origin/gh/angelayi/139/orig (forced update) 2025-12-04T09:21:59.9149806Z 19b29c7082c..87e3097991a gh/angelayi/140/base -> origin/gh/angelayi/140/base 2025-12-04T09:21:59.9153398Z 6e7640b6a12..4aba7c07434 gh/angelayi/140/head -> origin/gh/angelayi/140/head 2025-12-04T09:21:59.9157427Z + 6562d89b7c8...fec59ee2b87 gh/angelayi/140/orig -> origin/gh/angelayi/140/orig (forced update) 2025-12-04T09:21:59.9160962Z b82d7c85dc9..809bb28718e gh/angelayi/144/base -> origin/gh/angelayi/144/base 2025-12-04T09:21:59.9164366Z 1dd859e3b12..ee83769a925 gh/angelayi/144/head -> origin/gh/angelayi/144/head 2025-12-04T09:21:59.9168317Z + 956223a2673...6cd5696ab40 gh/angelayi/144/orig -> origin/gh/angelayi/144/orig (forced update) 2025-12-04T09:21:59.9174403Z * [new branch] gh/anijain2305/969/base -> origin/gh/anijain2305/969/base 2025-12-04T09:21:59.9177074Z * [new branch] gh/anijain2305/969/head -> origin/gh/anijain2305/969/head 2025-12-04T09:21:59.9179752Z * [new branch] gh/anijain2305/969/orig -> origin/gh/anijain2305/969/orig 2025-12-04T09:21:59.9183485Z * [new branch] gh/anijain2305/970/base -> origin/gh/anijain2305/970/base 2025-12-04T09:21:59.9186213Z * [new branch] gh/anijain2305/970/head -> origin/gh/anijain2305/970/head 2025-12-04T09:21:59.9188809Z * [new branch] gh/anijain2305/970/orig -> origin/gh/anijain2305/970/orig 2025-12-04T09:21:59.9194148Z bfa76f49aff..6d2d0859cb3 gh/aorenste/146/base -> origin/gh/aorenste/146/base 2025-12-04T09:21:59.9197895Z 47de5f4a428..068e7165ee6 gh/aorenste/146/head -> origin/gh/aorenste/146/head 2025-12-04T09:21:59.9201391Z + 0aafd61dc3f...6bfffbe9edc gh/aorenste/146/orig -> origin/gh/aorenste/146/orig (forced update) 2025-12-04T09:21:59.9205351Z 288eb932c32..38c33ffe1b9 gh/aorenste/147/base -> origin/gh/aorenste/147/base 2025-12-04T09:21:59.9209325Z 13cf2f9bff2..7202b500fdb gh/aorenste/147/head -> origin/gh/aorenste/147/head 2025-12-04T09:21:59.9212783Z + c412ad392ed...359c4eeb3ba gh/aorenste/147/orig -> origin/gh/aorenste/147/orig (forced update) 2025-12-04T09:21:59.9216596Z * [new branch] gh/aorenste/153/base -> origin/gh/aorenste/153/base 2025-12-04T09:21:59.9219234Z * [new branch] gh/aorenste/153/head -> origin/gh/aorenste/153/head 2025-12-04T09:21:59.9221593Z * [new branch] gh/aorenste/153/orig -> origin/gh/aorenste/153/orig 2025-12-04T09:21:59.9224990Z * [new branch] gh/aorenste/154/base -> origin/gh/aorenste/154/base 2025-12-04T09:21:59.9227567Z * [new branch] gh/aorenste/154/head -> origin/gh/aorenste/154/head 2025-12-04T09:21:59.9229987Z * [new branch] gh/aorenste/154/orig -> origin/gh/aorenste/154/orig 2025-12-04T09:21:59.9233382Z * [new branch] gh/aorenste/155/base -> origin/gh/aorenste/155/base 2025-12-04T09:21:59.9236326Z * [new branch] gh/aorenste/155/head -> origin/gh/aorenste/155/head 2025-12-04T09:21:59.9238457Z * [new branch] gh/aorenste/155/orig -> origin/gh/aorenste/155/orig 2025-12-04T09:21:59.9241882Z * [new branch] gh/aorenste/156/base -> origin/gh/aorenste/156/base 2025-12-04T09:21:59.9244442Z * [new branch] gh/aorenste/156/head -> origin/gh/aorenste/156/head 2025-12-04T09:21:59.9246965Z * [new branch] gh/aorenste/156/orig -> origin/gh/aorenste/156/orig 2025-12-04T09:21:59.9250602Z * [new branch] gh/aorenste/157/base -> origin/gh/aorenste/157/base 2025-12-04T09:21:59.9253198Z * [new branch] gh/aorenste/157/head -> origin/gh/aorenste/157/head 2025-12-04T09:21:59.9255547Z * [new branch] gh/aorenste/157/orig -> origin/gh/aorenste/157/orig 2025-12-04T09:21:59.9258001Z * [new branch] gh/aorenste/158/base -> origin/gh/aorenste/158/base 2025-12-04T09:21:59.9259741Z * [new branch] gh/aorenste/158/head -> origin/gh/aorenste/158/head 2025-12-04T09:21:59.9261494Z * [new branch] gh/aorenste/158/orig -> origin/gh/aorenste/158/orig 2025-12-04T09:21:59.9264082Z * [new branch] gh/aorenste/159/base -> origin/gh/aorenste/159/base 2025-12-04T09:21:59.9265798Z * [new branch] gh/aorenste/159/head -> origin/gh/aorenste/159/head 2025-12-04T09:21:59.9267535Z * [new branch] gh/aorenste/159/orig -> origin/gh/aorenste/159/orig 2025-12-04T09:21:59.9274248Z 93fef4bd1dd..22ff87511cc gh/desertfire/614/base -> origin/gh/desertfire/614/base 2025-12-04T09:21:59.9276759Z b6be118ed49..6c7a2724f55 gh/desertfire/614/head -> origin/gh/desertfire/614/head 2025-12-04T09:21:59.9279259Z + 003177df662...50a62cb0d97 gh/desertfire/614/orig -> origin/gh/desertfire/614/orig (forced update) 2025-12-04T09:21:59.9281815Z * [new branch] gh/desertfire/615/base -> origin/gh/desertfire/615/base 2025-12-04T09:21:59.9284016Z * [new branch] gh/desertfire/615/head -> origin/gh/desertfire/615/head 2025-12-04T09:21:59.9285749Z * [new branch] gh/desertfire/615/orig -> origin/gh/desertfire/615/orig 2025-12-04T09:21:59.9288460Z * [new branch] gh/desertfire/616/base -> origin/gh/desertfire/616/base 2025-12-04T09:21:59.9290342Z * [new branch] gh/desertfire/616/head -> origin/gh/desertfire/616/head 2025-12-04T09:21:59.9292116Z * [new branch] gh/desertfire/616/orig -> origin/gh/desertfire/616/orig 2025-12-04T09:21:59.9294739Z * [new branch] gh/desertfire/617/base -> origin/gh/desertfire/617/base 2025-12-04T09:21:59.9296564Z * [new branch] gh/desertfire/617/head -> origin/gh/desertfire/617/head 2025-12-04T09:21:59.9298302Z * [new branch] gh/desertfire/617/orig -> origin/gh/desertfire/617/orig 2025-12-04T09:21:59.9302012Z * [new branch] gh/drisspg/230/base -> origin/gh/drisspg/230/base 2025-12-04T09:21:59.9303795Z * [new branch] gh/drisspg/230/head -> origin/gh/drisspg/230/head 2025-12-04T09:21:59.9305637Z * [new branch] gh/drisspg/230/orig -> origin/gh/drisspg/230/orig 2025-12-04T09:21:59.9309608Z 080672a91bc..35f90ae3c85 gh/eellison/875/base -> origin/gh/eellison/875/base 2025-12-04T09:21:59.9312371Z 4912a077bf3..792469f7284 gh/eellison/875/head -> origin/gh/eellison/875/head 2025-12-04T09:21:59.9314879Z + f63acef9f4e...ef777373806 gh/eellison/875/orig -> origin/gh/eellison/875/orig (forced update) 2025-12-04T09:21:59.9317769Z * [new branch] gh/eellison/879/base -> origin/gh/eellison/879/base 2025-12-04T09:21:59.9319487Z * [new branch] gh/eellison/879/head -> origin/gh/eellison/879/head 2025-12-04T09:21:59.9321343Z * [new branch] gh/eellison/879/orig -> origin/gh/eellison/879/orig 2025-12-04T09:21:59.9324267Z * [new branch] gh/eellison/880/base -> origin/gh/eellison/880/base 2025-12-04T09:21:59.9325746Z * [new branch] gh/eellison/880/head -> origin/gh/eellison/880/head 2025-12-04T09:21:59.9327714Z * [new branch] gh/eellison/880/orig -> origin/gh/eellison/880/orig 2025-12-04T09:21:59.9330632Z * [new branch] gh/eellison/881/base -> origin/gh/eellison/881/base 2025-12-04T09:21:59.9332350Z * [new branch] gh/eellison/881/head -> origin/gh/eellison/881/head 2025-12-04T09:21:59.9334213Z * [new branch] gh/eellison/881/orig -> origin/gh/eellison/881/orig 2025-12-04T09:21:59.9336883Z * [new branch] gh/eellison/882/base -> origin/gh/eellison/882/base 2025-12-04T09:21:59.9338604Z * [new branch] gh/eellison/882/head -> origin/gh/eellison/882/head 2025-12-04T09:21:59.9340646Z * [new branch] gh/eellison/882/orig -> origin/gh/eellison/882/orig 2025-12-04T09:21:59.9343344Z * [new branch] gh/eellison/883/base -> origin/gh/eellison/883/base 2025-12-04T09:21:59.9345072Z * [new branch] gh/eellison/883/head -> origin/gh/eellison/883/head 2025-12-04T09:21:59.9346883Z * [new branch] gh/eellison/883/orig -> origin/gh/eellison/883/orig 2025-12-04T09:21:59.9349477Z * [new branch] gh/eellison/884/base -> origin/gh/eellison/884/base 2025-12-04T09:21:59.9351211Z * [new branch] gh/eellison/884/head -> origin/gh/eellison/884/head 2025-12-04T09:21:59.9352964Z * [new branch] gh/eellison/884/orig -> origin/gh/eellison/884/orig 2025-12-04T09:21:59.9356786Z cd118127f79..81b4b6952dc gh/etaf/173/base -> origin/gh/etaf/173/base 2025-12-04T09:21:59.9359411Z 3c5f17920d9..dd169264b24 gh/etaf/173/head -> origin/gh/etaf/173/head 2025-12-04T09:21:59.9361938Z + 13b4a9c0e66...5c6151f345e gh/etaf/173/orig -> origin/gh/etaf/173/orig (forced update) 2025-12-04T09:21:59.9364463Z dcb1bb649ce..ef511fb998a gh/etaf/175/base -> origin/gh/etaf/175/base 2025-12-04T09:21:59.9367839Z e1ba6de5817..1c7461fafd2 gh/etaf/175/head -> origin/gh/etaf/175/head 2025-12-04T09:21:59.9370172Z + b9f8e3373b0...e2bf9f43f58 gh/etaf/175/orig -> origin/gh/etaf/175/orig (forced update) 2025-12-04T09:21:59.9372845Z 784ec1fee17..eb4baa23ab4 gh/etaf/176/base -> origin/gh/etaf/176/base 2025-12-04T09:21:59.9375395Z 85e88deadbb..c3dee7e86de gh/etaf/176/head -> origin/gh/etaf/176/head 2025-12-04T09:21:59.9377131Z + ca81e1b4644...de97d2bcf41 gh/etaf/176/orig -> origin/gh/etaf/176/orig (forced update) 2025-12-04T09:21:59.9378946Z 55630f695d7..784519bb7df gh/etaf/177/base -> origin/gh/etaf/177/base 2025-12-04T09:21:59.9380584Z c2ba041a135..1a8ffb19b01 gh/etaf/177/head -> origin/gh/etaf/177/head 2025-12-04T09:21:59.9382243Z + 17251289f74...2ff6c1d61e4 gh/etaf/177/orig -> origin/gh/etaf/177/orig (forced update) 2025-12-04T09:21:59.9383998Z b2fc6ee3ff4..b29d2ebb7d9 gh/etaf/178/base -> origin/gh/etaf/178/base 2025-12-04T09:21:59.9385670Z daac151e69a..40844288963 gh/etaf/178/head -> origin/gh/etaf/178/head 2025-12-04T09:21:59.9387100Z + 940470561f0...8879b350357 gh/etaf/178/orig -> origin/gh/etaf/178/orig (forced update) 2025-12-04T09:21:59.9389978Z fdd6a62428b..8080e8932c7 gh/ezyang/3209/head -> origin/gh/ezyang/3209/head 2025-12-04T09:21:59.9391333Z + d0fe26e4532...a22ff6249e1 gh/ezyang/3209/orig -> origin/gh/ezyang/3209/orig (forced update) 2025-12-04T09:21:59.9392916Z 6da65d3f2d6..c46b5408f32 gh/fadara01/9/head -> origin/gh/fadara01/9/head 2025-12-04T09:21:59.9394407Z + fee1b5348eb...6cd4bb5cac6 gh/fadara01/9/orig -> origin/gh/fadara01/9/orig (forced update) 2025-12-04T09:21:59.9396580Z c1e3a296fd6..ffc76107196 gh/fduwjj/239/head -> origin/gh/fduwjj/239/head 2025-12-04T09:21:59.9397773Z + 30b540598be...b595c88900d gh/fduwjj/239/orig -> origin/gh/fduwjj/239/orig (forced update) 2025-12-04T09:21:59.9399291Z 0b3faeba995..b16d6feffa7 gh/fegin/335/head -> origin/gh/fegin/335/head 2025-12-04T09:21:59.9400758Z + f83cdf4accc...cb890a9dff5 gh/fegin/335/orig -> origin/gh/fegin/335/orig (forced update) 2025-12-04T09:21:59.9444345Z f4efae64a6c..84fc3987c9b gh/fffrog/181/base -> origin/gh/fffrog/181/base 2025-12-04T09:21:59.9446663Z 2592d5e401e..4b06f62b9f2 gh/fffrog/181/head -> origin/gh/fffrog/181/head 2025-12-04T09:21:59.9448576Z + f32430702b3...dc3975d6367 gh/fffrog/181/orig -> origin/gh/fffrog/181/orig (forced update) 2025-12-04T09:21:59.9450268Z 8b80eda1a79..1619c3d1471 gh/fxdawnn/12/base -> origin/gh/fxdawnn/12/base 2025-12-04T09:21:59.9451854Z c14595066e7..2c3ad62f7f9 gh/fxdawnn/12/head -> origin/gh/fxdawnn/12/head 2025-12-04T09:21:59.9453434Z + c14dcd1bf7b...7d358a1285b gh/fxdawnn/12/orig -> origin/gh/fxdawnn/12/orig (forced update) 2025-12-04T09:21:59.9455053Z ea07a5b2aa5..f1fd2070467 gh/fxdawnn/14/base -> origin/gh/fxdawnn/14/base 2025-12-04T09:21:59.9456688Z b6b63c0e35a..20f7d9df0bc gh/fxdawnn/14/head -> origin/gh/fxdawnn/14/head 2025-12-04T09:21:59.9458233Z + 04707fa5889...c7fd22a6ada gh/fxdawnn/14/orig -> origin/gh/fxdawnn/14/orig (forced update) 2025-12-04T09:21:59.9459874Z 73d39dcfe59..58c8ee866ab gh/fxdawnn/15/base -> origin/gh/fxdawnn/15/base 2025-12-04T09:21:59.9461478Z cbd90d61d6b..e0376a61634 gh/fxdawnn/15/head -> origin/gh/fxdawnn/15/head 2025-12-04T09:21:59.9463115Z + 2135c94d546...1c42e05e823 gh/fxdawnn/15/orig -> origin/gh/fxdawnn/15/orig (forced update) 2025-12-04T09:21:59.9465668Z becb6c1b3e5..8c6d6b90881 gh/guangyey/178/base -> origin/gh/guangyey/178/base 2025-12-04T09:21:59.9467379Z 0f132dedafc..29367a43881 gh/guangyey/178/head -> origin/gh/guangyey/178/head 2025-12-04T09:21:59.9469169Z + c19210e8b3f...9068d50bc9e gh/guangyey/178/orig -> origin/gh/guangyey/178/orig (forced update) 2025-12-04T09:21:59.9471955Z fe6b3de20eb..7dbb3ef1bad gh/guangyey/244/base -> origin/gh/guangyey/244/base 2025-12-04T09:21:59.9473242Z b5b1bc1e9f9..11f752e229a gh/guangyey/244/head -> origin/gh/guangyey/244/head 2025-12-04T09:21:59.9474855Z + 4b3ece481f0...70bb0887f11 gh/guangyey/244/orig -> origin/gh/guangyey/244/orig (forced update) 2025-12-04T09:21:59.9476686Z 996100f029a..d87c03f1fbe gh/guangyey/250/head -> origin/gh/guangyey/250/head 2025-12-04T09:21:59.9478251Z + 9b993a290b0...19bd201c1e7 gh/guangyey/250/orig -> origin/gh/guangyey/250/orig (forced update) 2025-12-04T09:21:59.9479626Z 901a7fb6123..9a0910359d0 gh/guangyey/251/base -> origin/gh/guangyey/251/base 2025-12-04T09:21:59.9481156Z ff3e11d227b..3a38261dceb gh/guangyey/251/head -> origin/gh/guangyey/251/head 2025-12-04T09:21:59.9482720Z + ec60d9fdb94...d589569e302 gh/guangyey/251/orig -> origin/gh/guangyey/251/orig (forced update) 2025-12-04T09:21:59.9484505Z dde53aab961..4c806264057 gh/guangyey/252/base -> origin/gh/guangyey/252/base 2025-12-04T09:21:59.9486343Z 679ff04d746..b73516aa8ad gh/guangyey/252/head -> origin/gh/guangyey/252/head 2025-12-04T09:21:59.9489227Z + a97a6714fa8...f6134bca8bf gh/guangyey/252/orig -> origin/gh/guangyey/252/orig (forced update) 2025-12-04T09:21:59.9490730Z 92254a44b20..81a2f81e4d0 gh/guangyey/253/base -> origin/gh/guangyey/253/base 2025-12-04T09:21:59.9492416Z f7f17c1e1d6..c8ca80d38f8 gh/guangyey/253/head -> origin/gh/guangyey/253/head 2025-12-04T09:21:59.9493947Z + c49170970ad...e1e73fc9845 gh/guangyey/253/orig -> origin/gh/guangyey/253/orig (forced update) 2025-12-04T09:21:59.9495501Z 94200138a6a..2884cf56063 gh/guangyey/254/base -> origin/gh/guangyey/254/base 2025-12-04T09:21:59.9497462Z 30bbaba5cd3..2fef5d598d1 gh/guangyey/254/head -> origin/gh/guangyey/254/head 2025-12-04T09:21:59.9498790Z + 59d1816d9f3...36cb426e77c gh/guangyey/254/orig -> origin/gh/guangyey/254/orig (forced update) 2025-12-04T09:21:59.9500324Z d211b75883e..293414b65c3 gh/guangyey/255/base -> origin/gh/guangyey/255/base 2025-12-04T09:21:59.9501999Z 6875b2bfc11..98e218a7c74 gh/guangyey/255/head -> origin/gh/guangyey/255/head 2025-12-04T09:21:59.9503602Z + 95b06629167...0f75619eedd gh/guangyey/255/orig -> origin/gh/guangyey/255/orig (forced update) 2025-12-04T09:21:59.9505458Z 1389ba0249a..72913d5c641 gh/guilhermeleobas/193/base -> origin/gh/guilhermeleobas/193/base 2025-12-04T09:21:59.9506802Z 29b7e057e03..26a40bcadfb gh/guilhermeleobas/193/head -> origin/gh/guilhermeleobas/193/head 2025-12-04T09:21:59.9508459Z + a57a64403cd...fe9f9a5aa5e gh/guilhermeleobas/193/orig -> origin/gh/guilhermeleobas/193/orig (forced update) 2025-12-04T09:21:59.9510410Z 5bb77363e2d..172630bc8ae gh/guilhermeleobas/264/base -> origin/gh/guilhermeleobas/264/base 2025-12-04T09:21:59.9511797Z 6678f759ada..c47798244b1 gh/guilhermeleobas/264/head -> origin/gh/guilhermeleobas/264/head 2025-12-04T09:21:59.9513194Z + 364605293c7...a6bb71ac02b gh/guilhermeleobas/264/orig -> origin/gh/guilhermeleobas/264/orig (forced update) 2025-12-04T09:21:59.9514667Z 6b66029732c..187cbb0bc8e gh/guilhermeleobas/267/head -> origin/gh/guilhermeleobas/267/head 2025-12-04T09:21:59.9516164Z + 3c4ba920f96...57faa206cc1 gh/guilhermeleobas/267/orig -> origin/gh/guilhermeleobas/267/orig (forced update) 2025-12-04T09:21:59.9517705Z 5252af4e1c1..77ad1a3e827 gh/isuruf/146/head -> origin/gh/isuruf/146/head 2025-12-04T09:21:59.9519217Z + 676a91e64d0...4533c12835f gh/isuruf/146/orig -> origin/gh/isuruf/146/orig (forced update) 2025-12-04T09:21:59.9522505Z 808cddef436..eb64b494986 gh/jansel/568/base -> origin/gh/jansel/568/base 2025-12-04T09:21:59.9523822Z e44d70911b9..11cc61beaad gh/jansel/568/head -> origin/gh/jansel/568/head 2025-12-04T09:21:59.9525314Z + 172135ba46d...e0936d73294 gh/jansel/568/orig -> origin/gh/jansel/568/orig (forced update) 2025-12-04T09:21:59.9526911Z 078768687fe..c520d51bcde gh/jansel/570/base -> origin/gh/jansel/570/base 2025-12-04T09:21:59.9528427Z 9faa9b1e432..ac88feac91d gh/jansel/570/head -> origin/gh/jansel/570/head 2025-12-04T09:21:59.9530082Z + 032fcc06400...6cba3ed5e99 gh/jansel/570/orig -> origin/gh/jansel/570/orig (forced update) 2025-12-04T09:21:59.9531633Z 076804a051a..212a6564c21 gh/jansel/573/base -> origin/gh/jansel/573/base 2025-12-04T09:21:59.9533129Z 24b299805dd..0569a20388c gh/jansel/573/head -> origin/gh/jansel/573/head 2025-12-04T09:21:59.9534613Z + b9beda38955...0473cb4b4e4 gh/jansel/573/orig -> origin/gh/jansel/573/orig (forced update) 2025-12-04T09:21:59.9536084Z aa0da2ebe62..adc9cc53c18 gh/jansel/574/base -> origin/gh/jansel/574/base 2025-12-04T09:21:59.9537547Z 4004866c85e..cd618a29a7b gh/jansel/574/head -> origin/gh/jansel/574/head 2025-12-04T09:21:59.9539091Z + cf66995833d...bbee6261ff9 gh/jansel/574/orig -> origin/gh/jansel/574/orig (forced update) 2025-12-04T09:21:59.9541563Z * [new branch] gh/jansel/575/base -> origin/gh/jansel/575/base 2025-12-04T09:21:59.9542272Z * [new branch] gh/jansel/575/head -> origin/gh/jansel/575/head 2025-12-04T09:21:59.9543417Z * [new branch] gh/jansel/575/orig -> origin/gh/jansel/575/orig 2025-12-04T09:21:59.9545105Z * [new branch] gh/jansel/576/base -> origin/gh/jansel/576/base 2025-12-04T09:21:59.9546199Z * [new branch] gh/jansel/576/head -> origin/gh/jansel/576/head 2025-12-04T09:21:59.9547211Z * [new branch] gh/jansel/576/orig -> origin/gh/jansel/576/orig 2025-12-04T09:21:59.9549612Z 5fa94c3e8da..c3983c5b825 gh/jerryzh168/1/base -> origin/gh/jerryzh168/1/base 2025-12-04T09:21:59.9550929Z 51bc6d323a6..2e5ba88ce46 gh/jerryzh168/1/head -> origin/gh/jerryzh168/1/head 2025-12-04T09:21:59.9552446Z + 295ec6464e4...656e2e6da7a gh/jerryzh168/1/orig -> origin/gh/jerryzh168/1/orig (forced update) 2025-12-04T09:21:59.9555305Z 1497fb9f170..f820f5574cc gh/karthickai/12/base -> origin/gh/karthickai/12/base 2025-12-04T09:21:59.9556706Z f57d2c27e7b..4195b285002 gh/karthickai/12/head -> origin/gh/karthickai/12/head 2025-12-04T09:21:59.9558413Z + 098a425a9dd...1f0666ff440 gh/karthickai/12/orig -> origin/gh/karthickai/12/orig (forced update) 2025-12-04T09:21:59.9559838Z 42b7b07b188..c4f12da1bd1 gh/karthickai/14/base -> origin/gh/karthickai/14/base 2025-12-04T09:21:59.9561363Z e396f0c26aa..5f87790cfe5 gh/karthickai/14/head -> origin/gh/karthickai/14/head 2025-12-04T09:21:59.9563059Z + f3cbd74b343...cbd4e08f13f gh/karthickai/14/orig -> origin/gh/karthickai/14/orig (forced update) 2025-12-04T09:21:59.9564424Z 0c192133e28..c14e8719770 gh/karthickai/16/base -> origin/gh/karthickai/16/base 2025-12-04T09:21:59.9565898Z 8ade28715f8..4adace586e4 gh/karthickai/16/head -> origin/gh/karthickai/16/head 2025-12-04T09:21:59.9567764Z + da19cb63ec1...caddd81a6c4 gh/karthickai/16/orig -> origin/gh/karthickai/16/orig (forced update) 2025-12-04T09:21:59.9569019Z 54cdf098b98..d5bdef8a198 gh/karthickai/17/base -> origin/gh/karthickai/17/base 2025-12-04T09:21:59.9570451Z 648e5393628..bc14f6e367e gh/karthickai/17/head -> origin/gh/karthickai/17/head 2025-12-04T09:21:59.9571937Z + 1e964e36f04...efc93f33afc gh/karthickai/17/orig -> origin/gh/karthickai/17/orig (forced update) 2025-12-04T09:21:59.9573978Z 794da4b8d31..abe882d145c gh/karthickai/20/base -> origin/gh/karthickai/20/base 2025-12-04T09:21:59.9576027Z 7875098ff90..149b378b8c0 gh/karthickai/20/head -> origin/gh/karthickai/20/head 2025-12-04T09:21:59.9577516Z + 5539dd24501...2355c0e68e2 gh/karthickai/20/orig -> origin/gh/karthickai/20/orig (forced update) 2025-12-04T09:21:59.9579266Z f8f960b8c75..6bb07dfb0da gh/karthickai/21/base -> origin/gh/karthickai/21/base 2025-12-04T09:21:59.9581058Z 3080d3c4858..ba7ff855554 gh/karthickai/21/head -> origin/gh/karthickai/21/head 2025-12-04T09:21:59.9582543Z + 58e0c6157c7...010f721dcde gh/karthickai/21/orig -> origin/gh/karthickai/21/orig (forced update) 2025-12-04T09:21:59.9584211Z ed9b9748cb6..f3149587ff1 gh/karthickai/24/base -> origin/gh/karthickai/24/base 2025-12-04T09:21:59.9585756Z 8b51488628f..7d88155ea7d gh/karthickai/24/head -> origin/gh/karthickai/24/head 2025-12-04T09:21:59.9587255Z + df8c222c239...f10d47523a9 gh/karthickai/24/orig -> origin/gh/karthickai/24/orig (forced update) 2025-12-04T09:21:59.9588867Z * [new branch] gh/karthickai/26/base -> origin/gh/karthickai/26/base 2025-12-04T09:21:59.9590069Z * [new branch] gh/karthickai/26/head -> origin/gh/karthickai/26/head 2025-12-04T09:21:59.9591103Z * [new branch] gh/karthickai/26/orig -> origin/gh/karthickai/26/orig 2025-12-04T09:21:59.9593397Z * [new branch] gh/krocki/1/base -> origin/gh/krocki/1/base 2025-12-04T09:21:59.9630266Z * [new branch] gh/krocki/1/head -> origin/gh/krocki/1/head 2025-12-04T09:21:59.9630714Z * [new branch] gh/krocki/1/orig -> origin/gh/krocki/1/orig 2025-12-04T09:21:59.9631102Z * [new branch] gh/krocki/2/base -> origin/gh/krocki/2/base 2025-12-04T09:21:59.9631485Z * [new branch] gh/krocki/2/head -> origin/gh/krocki/2/head 2025-12-04T09:21:59.9631871Z * [new branch] gh/krocki/2/orig -> origin/gh/krocki/2/orig 2025-12-04T09:21:59.9632313Z 873a5b2fbe5..490302e9a32 gh/kurtamohler/61/base -> origin/gh/kurtamohler/61/base 2025-12-04T09:21:59.9633086Z 17abcb6ab9a..6e671c36fbe gh/kurtamohler/61/head -> origin/gh/kurtamohler/61/head 2025-12-04T09:21:59.9633622Z + b81ec383bc3...32795ec950a gh/kurtamohler/61/orig -> origin/gh/kurtamohler/61/orig (forced update) 2025-12-04T09:21:59.9634148Z 899ad273052..e87deb21c05 gh/kurtamohler/63/base -> origin/gh/kurtamohler/63/base 2025-12-04T09:21:59.9634619Z c87648a1382..99d7c708304 gh/kurtamohler/63/head -> origin/gh/kurtamohler/63/head 2025-12-04T09:21:59.9635127Z + 23bd6f8eacf...521d95f8d2f gh/kurtamohler/63/orig -> origin/gh/kurtamohler/63/orig (forced update) 2025-12-04T09:21:59.9635643Z fc22e4dedfd..173900d1916 gh/kurtamohler/64/base -> origin/gh/kurtamohler/64/base 2025-12-04T09:21:59.9636115Z c4e1675e724..016d4c99f62 gh/kurtamohler/64/head -> origin/gh/kurtamohler/64/head 2025-12-04T09:21:59.9636625Z + fb296fb2bc6...dc74d0ee749 gh/kurtamohler/64/orig -> origin/gh/kurtamohler/64/orig (forced update) 2025-12-04T09:21:59.9637316Z 7b540a2e1e5..97c322dc605 gh/kurtamohler/65/base -> origin/gh/kurtamohler/65/base 2025-12-04T09:21:59.9637790Z 00a87c3639f..b3d8616704a gh/kurtamohler/65/head -> origin/gh/kurtamohler/65/head 2025-12-04T09:21:59.9638316Z + bd7387a5fa6...ad78e74e5ee gh/kurtamohler/65/orig -> origin/gh/kurtamohler/65/orig (forced update) 2025-12-04T09:21:59.9638833Z 7661aacb65a..0fa3e9f39ee gh/kurtamohler/66/base -> origin/gh/kurtamohler/66/base 2025-12-04T09:21:59.9639304Z 4dd821ae2e0..4ab9d707231 gh/kurtamohler/66/head -> origin/gh/kurtamohler/66/head 2025-12-04T09:21:59.9639806Z + d6a3880d793...17c295cb6b4 gh/kurtamohler/66/orig -> origin/gh/kurtamohler/66/orig (forced update) 2025-12-04T09:21:59.9640285Z * [new branch] gh/kurtamohler/67/base -> origin/gh/kurtamohler/67/base 2025-12-04T09:21:59.9640692Z * [new branch] gh/kurtamohler/67/head -> origin/gh/kurtamohler/67/head 2025-12-04T09:21:59.9641092Z * [new branch] gh/kurtamohler/67/orig -> origin/gh/kurtamohler/67/orig 2025-12-04T09:21:59.9641510Z * [new branch] gh/masnesral/1/base -> origin/gh/masnesral/1/base 2025-12-04T09:21:59.9641972Z * [new branch] gh/masnesral/1/head -> origin/gh/masnesral/1/head 2025-12-04T09:21:59.9642356Z * [new branch] gh/masnesral/1/orig -> origin/gh/masnesral/1/orig 2025-12-04T09:21:59.9642827Z 5778f6ff894..64b551ea4c0 gh/mikaylagawarecki/391/base -> origin/gh/mikaylagawarecki/391/base 2025-12-04T09:21:59.9643373Z b77b60a1f03..06c009bcdb7 gh/mikaylagawarecki/391/head -> origin/gh/mikaylagawarecki/391/head 2025-12-04T09:21:59.9643976Z + 4bbbb2325ec...9b6bcf41913 gh/mikaylagawarecki/391/orig -> origin/gh/mikaylagawarecki/391/orig (forced update) 2025-12-04T09:21:59.9644556Z * [new branch] gh/mikaylagawarecki/392/base -> origin/gh/mikaylagawarecki/392/base 2025-12-04T09:21:59.9645046Z * [new branch] gh/mikaylagawarecki/392/head -> origin/gh/mikaylagawarecki/392/head 2025-12-04T09:21:59.9645512Z * [new branch] gh/mikaylagawarecki/392/orig -> origin/gh/mikaylagawarecki/392/orig 2025-12-04T09:21:59.9648308Z 4867e16da64..eb6fda2184a gh/mlazos/71/head -> origin/gh/mlazos/71/head 2025-12-04T09:21:59.9649828Z + 9ef7ff92990...989d05b66d5 gh/mlazos/71/orig -> origin/gh/mlazos/71/orig (forced update) 2025-12-04T09:21:59.9651246Z 51c86186d80..320ffa40588 gh/mlazos/72/base -> origin/gh/mlazos/72/base 2025-12-04T09:21:59.9652744Z 3b41b9a0349..9491e0d4426 gh/mlazos/72/head -> origin/gh/mlazos/72/head 2025-12-04T09:21:59.9654274Z + 173630c957d...c3514c10842 gh/mlazos/72/orig -> origin/gh/mlazos/72/orig (forced update) 2025-12-04T09:21:59.9655891Z * [new branch] gh/mlazos/73/base -> origin/gh/mlazos/73/base 2025-12-04T09:21:59.9657525Z * [new branch] gh/mlazos/73/head -> origin/gh/mlazos/73/head 2025-12-04T09:21:59.9658844Z * [new branch] gh/mlazos/73/orig -> origin/gh/mlazos/73/orig 2025-12-04T09:21:59.9660664Z 840eed987cf..10bf5bad3d9 gh/nikitaved/13/base -> origin/gh/nikitaved/13/base 2025-12-04T09:21:59.9662124Z 215ce1799d1..1a36a4abee9 gh/nikitaved/13/head -> origin/gh/nikitaved/13/head 2025-12-04T09:21:59.9663690Z + afb7c9edc8a...62a0f999ec5 gh/nikitaved/13/orig -> origin/gh/nikitaved/13/orig (forced update) 2025-12-04T09:21:59.9665384Z * [new branch] gh/nikitaved/14/base -> origin/gh/nikitaved/14/base 2025-12-04T09:21:59.9666460Z * [new branch] gh/nikitaved/14/head -> origin/gh/nikitaved/14/head 2025-12-04T09:21:59.9667560Z * [new branch] gh/nikitaved/14/orig -> origin/gh/nikitaved/14/orig 2025-12-04T09:21:59.9669063Z * [new branch] gh/nikitaved/15/base -> origin/gh/nikitaved/15/base 2025-12-04T09:21:59.9670240Z * [new branch] gh/nikitaved/15/head -> origin/gh/nikitaved/15/head 2025-12-04T09:21:59.9671349Z * [new branch] gh/nikitaved/15/orig -> origin/gh/nikitaved/15/orig 2025-12-04T09:21:59.9672889Z * [new branch] gh/nikitaved/16/base -> origin/gh/nikitaved/16/base 2025-12-04T09:21:59.9674035Z * [new branch] gh/nikitaved/16/head -> origin/gh/nikitaved/16/head 2025-12-04T09:21:59.9675113Z * [new branch] gh/nikitaved/16/orig -> origin/gh/nikitaved/16/orig 2025-12-04T09:21:59.9676728Z 565a4eb187f..c7b4c292a41 gh/nikitaved/4/base -> origin/gh/nikitaved/4/base 2025-12-04T09:21:59.9678129Z 514b040928f..84c1f99b0f5 gh/nikitaved/4/head -> origin/gh/nikitaved/4/head 2025-12-04T09:21:59.9679613Z + 50f767e6907...31993d37d48 gh/nikitaved/4/orig -> origin/gh/nikitaved/4/orig (forced update) 2025-12-04T09:21:59.9681993Z 3d1dedcd17e..e25f0501860 gh/oulgen/26/base -> origin/gh/oulgen/26/base 2025-12-04T09:21:59.9683148Z 5640ca564e4..269693d23a8 gh/oulgen/26/head -> origin/gh/oulgen/26/head 2025-12-04T09:21:59.9684644Z + 7801f93c1b3...b512d5ef730 gh/oulgen/26/orig -> origin/gh/oulgen/26/orig (forced update) 2025-12-04T09:21:59.9687198Z 6a84331ef1f..f4ea3e7f638 gh/pearu/156/base -> origin/gh/pearu/156/base 2025-12-04T09:21:59.9688457Z 89678969757..1750b3d27df gh/pearu/156/head -> origin/gh/pearu/156/head 2025-12-04T09:21:59.9690045Z + ea4d6a6e6c1...18c028053b6 gh/pearu/156/orig -> origin/gh/pearu/156/orig (forced update) 2025-12-04T09:21:59.9692131Z a1510096b58..5abb1ec9620 gh/pianpwk/34/base -> origin/gh/pianpwk/34/base 2025-12-04T09:21:59.9693816Z 9274ad1656f..ab5ed963922 gh/pianpwk/34/head -> origin/gh/pianpwk/34/head 2025-12-04T09:21:59.9695532Z + 377c5ade373...b10d6d1407f gh/pianpwk/34/orig -> origin/gh/pianpwk/34/orig (forced update) 2025-12-04T09:21:59.9696984Z f0878456242..9aab29c37ee gh/pianpwk/35/base -> origin/gh/pianpwk/35/base 2025-12-04T09:21:59.9698517Z 5c38c848d72..f05bc291538 gh/pianpwk/35/head -> origin/gh/pianpwk/35/head 2025-12-04T09:21:59.9700263Z + b7130e4db37...1509a915f5b gh/pianpwk/35/orig -> origin/gh/pianpwk/35/orig (forced update) 2025-12-04T09:21:59.9702166Z ba9aaba96ca..802954e7384 gh/rec/166/base -> origin/gh/rec/166/base 2025-12-04T09:21:59.9703816Z d78765019f7..e8e621f56f5 gh/rec/166/head -> origin/gh/rec/166/head 2025-12-04T09:21:59.9705520Z + 26dc95e323b...67b568b7c8e gh/rec/166/orig -> origin/gh/rec/166/orig (forced update) 2025-12-04T09:21:59.9706886Z 9b0e46b797a..822604e124e gh/rec/167/base -> origin/gh/rec/167/base 2025-12-04T09:21:59.9708472Z dc8e47dee96..885db95dd05 gh/rec/167/head -> origin/gh/rec/167/head 2025-12-04T09:21:59.9710040Z + 0c5abea5ba3...a15f4379258 gh/rec/167/orig -> origin/gh/rec/167/orig (forced update) 2025-12-04T09:21:59.9711619Z ddf30fd3aff..003e19785b2 gh/rec/170/base -> origin/gh/rec/170/base 2025-12-04T09:21:59.9714471Z 8ecbf8919ae..c7581466eab gh/rec/170/head -> origin/gh/rec/170/head 2025-12-04T09:21:59.9715371Z + 10d81100e41...3bf4ac36af3 gh/rec/170/orig -> origin/gh/rec/170/orig (forced update) 2025-12-04T09:21:59.9716256Z 959d91b2047..e1d7f1c7416 gh/rec/174/base -> origin/gh/rec/174/base 2025-12-04T09:21:59.9717819Z b2abae77ab0..20c728b958a gh/rec/174/head -> origin/gh/rec/174/head 2025-12-04T09:21:59.9719792Z + 611ea71b351...5aaae785245 gh/rec/174/orig -> origin/gh/rec/174/orig (forced update) 2025-12-04T09:21:59.9721053Z 9ea7c4c5cb4..d302288fa27 gh/rec/176/base -> origin/gh/rec/176/base 2025-12-04T09:21:59.9722444Z 5074ffc5931..48ec33d5a07 gh/rec/176/head -> origin/gh/rec/176/head 2025-12-04T09:21:59.9723928Z + 8b606954b44...e68af845fd0 gh/rec/176/orig -> origin/gh/rec/176/orig (forced update) 2025-12-04T09:21:59.9725473Z * [new branch] gh/rec/177/base -> origin/gh/rec/177/base 2025-12-04T09:21:59.9726783Z * [new branch] gh/rec/177/head -> origin/gh/rec/177/head 2025-12-04T09:21:59.9727957Z * [new branch] gh/rec/177/orig -> origin/gh/rec/177/orig 2025-12-04T09:21:59.9730430Z + a025aa9a7ce...1712c29f453 gh/rtimpe/35/base -> origin/gh/rtimpe/35/base (forced update) 2025-12-04T09:21:59.9732017Z + ebfd9fb0ff3...443d8a93c7c gh/rtimpe/35/head -> origin/gh/rtimpe/35/head (forced update) 2025-12-04T09:21:59.9733616Z + ed8ad8555f2...e4156457ef0 gh/rtimpe/35/orig -> origin/gh/rtimpe/35/orig (forced update) 2025-12-04T09:21:59.9735818Z + 2bdf80bae7d...a64bbe22394 gh/seemethere/74/orig -> origin/gh/seemethere/74/orig (forced update) 2025-12-04T09:21:59.9737309Z + c6ef08d858f...bda6d3d8870 gh/seemethere/75/orig -> origin/gh/seemethere/75/orig (forced update) 2025-12-04T09:21:59.9738998Z * [new branch] gh/seemethere/76/base -> origin/gh/seemethere/76/base 2025-12-04T09:21:59.9740199Z * [new branch] gh/seemethere/76/head -> origin/gh/seemethere/76/head 2025-12-04T09:21:59.9741402Z * [new branch] gh/seemethere/76/orig -> origin/gh/seemethere/76/orig 2025-12-04T09:21:59.9774005Z 0055f079976..afb0b45536c gh/tugsbayasgalan/55/base -> origin/gh/tugsbayasgalan/55/base 2025-12-04T09:21:59.9776285Z 9df1890b1e1..b7d8b06e8f2 gh/tugsbayasgalan/55/head -> origin/gh/tugsbayasgalan/55/head 2025-12-04T09:21:59.9778589Z + 6fa9bd6128b...1c4132f636d gh/tugsbayasgalan/55/orig -> origin/gh/tugsbayasgalan/55/orig (forced update) 2025-12-04T09:21:59.9780926Z 68f8429141e..20411280e60 gh/tugsbayasgalan/71/base -> origin/gh/tugsbayasgalan/71/base 2025-12-04T09:21:59.9783062Z 91233b61705..6b5391ec1f8 gh/tugsbayasgalan/71/head -> origin/gh/tugsbayasgalan/71/head 2025-12-04T09:21:59.9785047Z + 518d9532b87...3bb02cee4c5 gh/tugsbayasgalan/71/orig -> origin/gh/tugsbayasgalan/71/orig (forced update) 2025-12-04T09:21:59.9787241Z 73e93fca63e..6bc2a9c64e1 gh/tugsbayasgalan/91/base -> origin/gh/tugsbayasgalan/91/base 2025-12-04T09:21:59.9788803Z 4c2cfd76fca..e371f99d141 gh/tugsbayasgalan/91/head -> origin/gh/tugsbayasgalan/91/head 2025-12-04T09:21:59.9790455Z + c9f966f1661...2e7ec5a1733 gh/tugsbayasgalan/91/orig -> origin/gh/tugsbayasgalan/91/orig (forced update) 2025-12-04T09:21:59.9791964Z 75374cbca4a..4261eed5a23 gh/tugsbayasgalan/92/head -> origin/gh/tugsbayasgalan/92/head 2025-12-04T09:21:59.9793477Z + 43292eabf1a...79024692022 gh/tugsbayasgalan/92/orig -> origin/gh/tugsbayasgalan/92/orig (forced update) 2025-12-04T09:21:59.9795282Z * [new branch] gh/tugsbayasgalan/93/base -> origin/gh/tugsbayasgalan/93/base 2025-12-04T09:21:59.9796463Z * [new branch] gh/tugsbayasgalan/93/head -> origin/gh/tugsbayasgalan/93/head 2025-12-04T09:21:59.9797548Z * [new branch] gh/tugsbayasgalan/93/orig -> origin/gh/tugsbayasgalan/93/orig 2025-12-04T09:21:59.9800541Z 158e2af2128..41b946af59a gh/wconstab/456/base -> origin/gh/wconstab/456/base 2025-12-04T09:21:59.9802463Z 703ddf0eb4e..531b5d5ec47 gh/wconstab/456/head -> origin/gh/wconstab/456/head 2025-12-04T09:21:59.9804081Z + e0dff8b0776...1fc855d8264 gh/wconstab/456/orig -> origin/gh/wconstab/456/orig (forced update) 2025-12-04T09:21:59.9806028Z fad803b93dc..94495834534 gh/wconstab/458/base -> origin/gh/wconstab/458/base 2025-12-04T09:21:59.9808743Z 68c2f8f3c33..12d36c49fe6 gh/wconstab/458/head -> origin/gh/wconstab/458/head 2025-12-04T09:21:59.9810320Z + a63ed1ed1be...6fc3ac45eed gh/wconstab/458/orig -> origin/gh/wconstab/458/orig (forced update) 2025-12-04T09:21:59.9811995Z 8c2aa401719..f254b7579f7 gh/wconstab/459/base -> origin/gh/wconstab/459/base 2025-12-04T09:21:59.9813791Z a56f283f54a..0bc719692a7 gh/wconstab/459/head -> origin/gh/wconstab/459/head 2025-12-04T09:21:59.9815353Z + 033ee127ee4...249e4b26902 gh/wconstab/459/orig -> origin/gh/wconstab/459/orig (forced update) 2025-12-04T09:21:59.9817420Z 94865c415ab..6354d33a6c1 gh/wconstab/460/base -> origin/gh/wconstab/460/base 2025-12-04T09:21:59.9819370Z 5040c209fd1..87b475d75c2 gh/wconstab/460/head -> origin/gh/wconstab/460/head 2025-12-04T09:21:59.9821290Z + d7810b39802...2464f9005e4 gh/wconstab/460/orig -> origin/gh/wconstab/460/orig (forced update) 2025-12-04T09:21:59.9822711Z 703ddf0eb4e..009624ec615 gh/wconstab/465/base -> origin/gh/wconstab/465/base 2025-12-04T09:21:59.9824228Z 5c08deeb731..c6217821ad8 gh/wconstab/465/head -> origin/gh/wconstab/465/head 2025-12-04T09:21:59.9825696Z + 7fa72c6e36c...b1ef691dfaa gh/wconstab/465/orig -> origin/gh/wconstab/465/orig (forced update) 2025-12-04T09:21:59.9827781Z fd14c10030a..3c0d7535ea3 gh/wconstab/466/base -> origin/gh/wconstab/466/base 2025-12-04T09:21:59.9828827Z 1113063fd4d..f72f5c1298f gh/wconstab/466/head -> origin/gh/wconstab/466/head 2025-12-04T09:21:59.9830332Z + 1a65d268f9d...358f191c43c gh/wconstab/466/orig -> origin/gh/wconstab/466/orig (forced update) 2025-12-04T09:21:59.9831932Z * [new branch] gh/wconstab/467/base -> origin/gh/wconstab/467/base 2025-12-04T09:21:59.9833159Z * [new branch] gh/wconstab/467/head -> origin/gh/wconstab/467/head 2025-12-04T09:21:59.9834414Z * [new branch] gh/wconstab/467/orig -> origin/gh/wconstab/467/orig 2025-12-04T09:21:59.9836040Z * [new branch] gh/wconstab/468/base -> origin/gh/wconstab/468/base 2025-12-04T09:21:59.9837227Z * [new branch] gh/wconstab/468/head -> origin/gh/wconstab/468/head 2025-12-04T09:21:59.9838293Z * [new branch] gh/wconstab/468/orig -> origin/gh/wconstab/468/orig 2025-12-04T09:21:59.9840113Z b79e95056a1..7f5f4abc192 gh/weifengpy/39/base -> origin/gh/weifengpy/39/base 2025-12-04T09:21:59.9841517Z 341d7106631..6f45dfd2dfe gh/weifengpy/39/head -> origin/gh/weifengpy/39/head 2025-12-04T09:21:59.9843130Z + 89108dd0d68...2cc7fbbc86a gh/weifengpy/39/orig -> origin/gh/weifengpy/39/orig (forced update) 2025-12-04T09:21:59.9845546Z 4236a5f011a..b25fd08c83a gh/williamwen42/296/base -> origin/gh/williamwen42/296/base 2025-12-04T09:21:59.9847353Z af193117437..fa9cef9ff3c gh/williamwen42/296/head -> origin/gh/williamwen42/296/head 2025-12-04T09:21:59.9849011Z + 9d3e9994eaa...1108639906f gh/williamwen42/296/orig -> origin/gh/williamwen42/296/orig (forced update) 2025-12-04T09:21:59.9850734Z 34470f63391..42eddd4456d gh/williamwen42/297/base -> origin/gh/williamwen42/297/base 2025-12-04T09:21:59.9852732Z 021017a0f28..a817c89e4cc gh/williamwen42/297/head -> origin/gh/williamwen42/297/head 2025-12-04T09:21:59.9854083Z + 0348e4398d2...83025dc3c22 gh/williamwen42/297/orig -> origin/gh/williamwen42/297/orig (forced update) 2025-12-04T09:21:59.9855827Z 162577e9273..f5c81aac721 gh/williamwen42/310/base -> origin/gh/williamwen42/310/base 2025-12-04T09:21:59.9857612Z a6dd8387947..ca0f9aef749 gh/williamwen42/310/head -> origin/gh/williamwen42/310/head 2025-12-04T09:21:59.9859060Z + 95148f1ff40...6d5f52ac20d gh/williamwen42/310/orig -> origin/gh/williamwen42/310/orig (forced update) 2025-12-04T09:21:59.9861579Z 7e51a23955d..94d6129d2ca gh/williamwen42/348/base -> origin/gh/williamwen42/348/base 2025-12-04T09:21:59.9863143Z abcb9adc30d..9c12cc4e2d6 gh/williamwen42/348/head -> origin/gh/williamwen42/348/head 2025-12-04T09:21:59.9864632Z + 439d7ae9104...06f2e4f3032 gh/williamwen42/348/orig -> origin/gh/williamwen42/348/orig (forced update) 2025-12-04T09:21:59.9866252Z 0834960aeca..bf13dde6c6b gh/williamwen42/351/base -> origin/gh/williamwen42/351/base 2025-12-04T09:21:59.9867832Z c17bf10c91e..f92e1994896 gh/williamwen42/351/head -> origin/gh/williamwen42/351/head 2025-12-04T09:21:59.9869300Z + cfdd6f12d38...b3fad884c5b gh/williamwen42/351/orig -> origin/gh/williamwen42/351/orig (forced update) 2025-12-04T09:21:59.9870747Z 7caf1540b20..3d34fc0288c gh/williamwen42/352/base -> origin/gh/williamwen42/352/base 2025-12-04T09:21:59.9872190Z 729c5004850..c317ce5143b gh/williamwen42/352/head -> origin/gh/williamwen42/352/head 2025-12-04T09:21:59.9873736Z + ee027188a99...c13e25f0623 gh/williamwen42/352/orig -> origin/gh/williamwen42/352/orig (forced update) 2025-12-04T09:21:59.9875259Z 2333483436f..2fc2f1d43ce gh/williamwen42/353/base -> origin/gh/williamwen42/353/base 2025-12-04T09:21:59.9876741Z 8db715ab37f..e0c11b85fd2 gh/williamwen42/353/head -> origin/gh/williamwen42/353/head 2025-12-04T09:21:59.9878257Z + 1af760616ce...e0df7521ef5 gh/williamwen42/353/orig -> origin/gh/williamwen42/353/orig (forced update) 2025-12-04T09:21:59.9879873Z 18d5daeb16e..6ae99d08a46 gh/williamwen42/354/base -> origin/gh/williamwen42/354/base 2025-12-04T09:21:59.9881391Z 108f62c2692..8cc42eeb789 gh/williamwen42/354/head -> origin/gh/williamwen42/354/head 2025-12-04T09:21:59.9882940Z + 80194d19904...d4aa7b96e28 gh/williamwen42/354/orig -> origin/gh/williamwen42/354/orig (forced update) 2025-12-04T09:21:59.9884646Z * [new branch] gh/williamwen42/356/base -> origin/gh/williamwen42/356/base 2025-12-04T09:21:59.9885825Z * [new branch] gh/williamwen42/356/head -> origin/gh/williamwen42/356/head 2025-12-04T09:21:59.9887109Z * [new branch] gh/williamwen42/356/orig -> origin/gh/williamwen42/356/orig 2025-12-04T09:21:59.9888797Z * [new branch] gh/williamwen42/357/base -> origin/gh/williamwen42/357/base 2025-12-04T09:21:59.9889973Z * [new branch] gh/williamwen42/357/head -> origin/gh/williamwen42/357/head 2025-12-04T09:21:59.9891080Z * [new branch] gh/williamwen42/357/orig -> origin/gh/williamwen42/357/orig 2025-12-04T09:21:59.9892771Z * [new branch] gh/williamwen42/358/base -> origin/gh/williamwen42/358/base 2025-12-04T09:21:59.9893925Z * [new branch] gh/williamwen42/358/head -> origin/gh/williamwen42/358/head 2025-12-04T09:21:59.9895353Z * [new branch] gh/williamwen42/358/orig -> origin/gh/williamwen42/358/orig 2025-12-04T09:21:59.9897288Z c9720b73dc4..0797e5771e1 gh/xmfan/312/base -> origin/gh/xmfan/312/base 2025-12-04T09:21:59.9898868Z 6c186d07af4..cd3e44304bd gh/xmfan/312/head -> origin/gh/xmfan/312/head 2025-12-04T09:21:59.9900427Z + c19eb1dfd87...ffc5889dc05 gh/xmfan/312/orig -> origin/gh/xmfan/312/orig (forced update) 2025-12-04T09:21:59.9902858Z * [new branch] gh/yang-yu-hang/3/base -> origin/gh/yang-yu-hang/3/base 2025-12-04T09:21:59.9904131Z * [new branch] gh/yang-yu-hang/3/head -> origin/gh/yang-yu-hang/3/head 2025-12-04T09:21:59.9905329Z * [new branch] gh/yang-yu-hang/3/orig -> origin/gh/yang-yu-hang/3/orig 2025-12-04T09:21:59.9909135Z 15c03b1ef56..8b955876eb5 gh/zpcore/26/base -> origin/gh/zpcore/26/base 2025-12-04T09:21:59.9910953Z 7c435a12e3c..b91bc7988a4 gh/zpcore/26/head -> origin/gh/zpcore/26/head 2025-12-04T09:21:59.9912455Z + 83bbc08a93e...449134738b0 gh/zpcore/26/orig -> origin/gh/zpcore/26/orig (forced update) 2025-12-04T09:21:59.9914642Z 66960a52b45..aeeebef8d53 gh/zpcore/28/base -> origin/gh/zpcore/28/base 2025-12-04T09:21:59.9916622Z 5b61e3dbc31..a2315c696b7 gh/zpcore/28/head -> origin/gh/zpcore/28/head 2025-12-04T09:21:59.9918141Z + a0aed5859db...eb3bf7263cc gh/zpcore/28/orig -> origin/gh/zpcore/28/orig (forced update) 2025-12-04T09:21:59.9920287Z + e812110fdd7...242d980a33e instrument-trunk-pull-linux-with-job-test-filters -> origin/instrument-trunk-pull-linux-with-job-test-filters (forced update) 2025-12-04T09:21:59.9922186Z + d9b114f50dc...2a005b9990b lucaskabela/typing_user_defined -> origin/lucaskabela/typing_user_defined (forced update) 2025-12-04T09:21:59.9923643Z 2bec68e73b6..7716da9fb23 main -> origin/main 2025-12-04T09:21:59.9925215Z * [new branch] malfet-patch-7 -> origin/malfet-patch-7 2025-12-04T09:21:59.9926863Z + 8c41d5d0214...c66fddf28a3 malfet/be-do-not-make-typos-in-build-artifacts -> origin/malfet/be-do-not-make-typos-in-build-artifacts (forced update) 2025-12-04T09:21:59.9928448Z + 173630c957d...c3514c10842 mlazos/bwd -> origin/mlazos/bwd (forced update) 2025-12-04T09:21:59.9936637Z * [new branch] mlazos/overguarding -> origin/mlazos/overguarding 2025-12-04T09:21:59.9936958Z * [new branch] ngimel/div_inv -> origin/ngimel/div_inv 2025-12-04T09:21:59.9937271Z ffdefc93b66..794e9585f39 nightly -> origin/nightly 2025-12-04T09:21:59.9937565Z * [new branch] norm_bench -> origin/norm_bench 2025-12-04T09:21:59.9937860Z * [new branch] oulgen-patch-4 -> origin/oulgen-patch-4 2025-12-04T09:21:59.9939131Z * [new branch] pianpwk/_debug_nn_module_compile -> origin/pianpwk/_debug_nn_module_compile 2025-12-04T09:21:59.9939520Z * [new branch] pianpwk/debug_mode_annotate -> origin/pianpwk/debug_mode_annotate 2025-12-04T09:21:59.9940537Z * [new branch] pianpwk/debug_mode_defaults -> origin/pianpwk/debug_mode_defaults 2025-12-04T09:21:59.9943490Z + bf4723344f4...fd4b2262ca8 some_rocm_inductor_skips -> origin/some_rocm_inductor_skips (forced update) 2025-12-04T09:21:59.9944855Z * [new branch] sy_annotation_in_autograd_hop -> origin/sy_annotation_in_autograd_hop 2025-12-04T09:21:59.9946209Z * [new branch] sy_export_annotation -> origin/sy_export_annotation 2025-12-04T09:21:59.9947514Z * [new branch] sy_original_dtensor -> origin/sy_original_dtensor 2025-12-04T09:21:59.9949804Z + ac073357b58...658580f64d1 tianren/customop_collectiveop_benchmark_fix -> origin/tianren/customop_collectiveop_benchmark_fix (forced update) 2025-12-04T09:21:59.9952294Z + f53e8e0ad67...599cdd0c9e2 tianren/dynamic_range_input_fix -> origin/tianren/dynamic_range_input_fix (forced update) 2025-12-04T09:21:59.9954150Z + f04191f3c49...0c45cf25250 update-triton-commit-hash/13663274526-1487-2 -> origin/update-triton-commit-hash/13663274526-1487-2 (forced update) 2025-12-04T09:21:59.9955665Z + 53f38bd17be...24981f9ef8d update-vllm-commit-hash/18451675449-1879-1 -> origin/update-vllm-commit-hash/18451675449-1879-1 (forced update) 2025-12-04T09:21:59.9957160Z + aa7c1f411f1...38f3bcae801 update-xla-commit-hash/19626841311-213-1 -> origin/update-xla-commit-hash/19626841311-213-1 (forced update) 2025-12-04T09:21:59.9959000Z 1174a507c35..a6c8d9a6d0a update_operator_readme -> origin/update_operator_readme 2025-12-04T09:21:59.9960719Z + 34f502be470...e5b6553e930 update_slow_tests_1763365283 -> origin/update_slow_tests_1763365283 (forced update) 2025-12-04T09:21:59.9962254Z + d621824e79d...c34efe4a025 update_submodule_FBGEMM -> origin/update_submodule_FBGEMM (forced update) 2025-12-04T09:21:59.9963930Z c6ae7579fe1..1d21b4df2ba viable/strict -> origin/viable/strict 2025-12-04T09:21:59.9965924Z + 054dfbd64b9...f9fb4d040af yiming/run_with_start_end_rng_hop -> origin/yiming/run_with_start_end_rng_hop (forced update) 2025-12-04T09:21:59.9969103Z t [tag update] ciflow/b200/115316 -> ciflow/b200/115316 2025-12-04T09:21:59.9969750Z * [new tag] ciflow/b200/161607 -> ciflow/b200/161607 2025-12-04T09:21:59.9970852Z t [tag update] ciflow/b200/167989 -> ciflow/b200/167989 2025-12-04T09:21:59.9971561Z * [new tag] ciflow/b200/168096 -> ciflow/b200/168096 2025-12-04T09:21:59.9972368Z * [new tag] ciflow/b200/168175 -> ciflow/b200/168175 2025-12-04T09:21:59.9973363Z t [tag update] ciflow/b200/168195 -> ciflow/b200/168195 2025-12-04T09:21:59.9974463Z t [tag update] ciflow/b200/169200 -> ciflow/b200/169200 2025-12-04T09:21:59.9975132Z * [new tag] ciflow/b200/169470 -> ciflow/b200/169470 2025-12-04T09:21:59.9975797Z * [new tag] ciflow/b200/169471 -> ciflow/b200/169471 2025-12-04T09:21:59.9976452Z * [new tag] ciflow/b200/169472 -> ciflow/b200/169472 2025-12-04T09:21:59.9977278Z * [new tag] ciflow/b200/169514 -> ciflow/b200/169514 2025-12-04T09:21:59.9977877Z * [new tag] ciflow/b200/169517 -> ciflow/b200/169517 2025-12-04T09:21:59.9978625Z * [new tag] ciflow/binaries/169510 -> ciflow/binaries/169510 2025-12-04T09:21:59.9979819Z t [tag update] ciflow/binaries_wheel/166829 -> ciflow/binaries_wheel/166829 2025-12-04T09:21:59.9980819Z t [tag update] ciflow/binaries_wheel/167972 -> ciflow/binaries_wheel/167972 2025-12-04T09:21:59.9981806Z * [new tag] ciflow/dynamo/167695 -> ciflow/dynamo/167695 2025-12-04T09:21:59.9982474Z * [new tag] ciflow/dynamo/168096 -> ciflow/dynamo/168096 2025-12-04T09:21:59.9983210Z * [new tag] ciflow/dynamo/169525 -> ciflow/dynamo/169525 2025-12-04T09:21:59.9984368Z t [tag update] ciflow/h100-symm-mem/167552 -> ciflow/h100-symm-mem/167552 2025-12-04T09:21:59.9985601Z t [tag update] ciflow/h100-symm-mem/169156 -> ciflow/h100-symm-mem/169156 2025-12-04T09:21:59.9986545Z t [tag update] ciflow/h100-symm-mem/169200 -> ciflow/h100-symm-mem/169200 2025-12-04T09:21:59.9987743Z t [tag update] ciflow/h100-symm-mem/169355 -> ciflow/h100-symm-mem/169355 2025-12-04T09:21:59.9988376Z * [new tag] ciflow/h100-symm-mem/169543 -> ciflow/h100-symm-mem/169543 2025-12-04T09:21:59.9989334Z t [tag update] ciflow/h100/115316 -> ciflow/h100/115316 2025-12-04T09:21:59.9990049Z * [new tag] ciflow/h100/161607 -> ciflow/h100/161607 2025-12-04T09:21:59.9991127Z t [tag update] ciflow/h100/167989 -> ciflow/h100/167989 2025-12-04T09:21:59.9991743Z * [new tag] ciflow/h100/168096 -> ciflow/h100/168096 2025-12-04T09:21:59.9992417Z * [new tag] ciflow/h100/168175 -> ciflow/h100/168175 2025-12-04T09:21:59.9993449Z t [tag update] ciflow/h100/168195 -> ciflow/h100/168195 2025-12-04T09:21:59.9994480Z t [tag update] ciflow/h100/169200 -> ciflow/h100/169200 2025-12-04T09:21:59.9995119Z * [new tag] ciflow/h100/169470 -> ciflow/h100/169470 2025-12-04T09:21:59.9995796Z * [new tag] ciflow/h100/169471 -> ciflow/h100/169471 2025-12-04T09:21:59.9996490Z * [new tag] ciflow/h100/169472 -> ciflow/h100/169472 2025-12-04T09:21:59.9997148Z * [new tag] ciflow/h100/169514 -> ciflow/h100/169514 2025-12-04T09:21:59.9998241Z * [new tag] ciflow/inductor-cu126/168096 -> ciflow/inductor-cu126/168096 2025-12-04T09:21:59.9999077Z * [new tag] ciflow/inductor-micro-benchmark-cpu-x86/168096 -> ciflow/inductor-micro-benchmark-cpu-x86/168096 2025-12-04T09:22:00.0000037Z t [tag update] ciflow/inductor-micro-benchmark/166165 -> ciflow/inductor-micro-benchmark/166165 2025-12-04T09:22:00.0000761Z * [new tag] ciflow/inductor-micro-benchmark/168096 -> ciflow/inductor-micro-benchmark/168096 2025-12-04T09:22:00.0001613Z * [new tag] ciflow/inductor-perf-compare/168096 -> ciflow/inductor-perf-compare/168096 2025-12-04T09:22:00.0002430Z * [new tag] ciflow/inductor-perf-test-nightly-rocm-mi300/168096 -> ciflow/inductor-perf-test-nightly-rocm-mi300/168096 2025-12-04T09:22:00.0003533Z t [tag update] ciflow/inductor-perf-test-nightly-rocm-mi300/169024 -> ciflow/inductor-perf-test-nightly-rocm-mi300/169024 2025-12-04T09:22:00.0004595Z t [tag update] ciflow/inductor-perf-test-nightly-rocm-mi355/169024 -> ciflow/inductor-perf-test-nightly-rocm-mi355/169024 2025-12-04T09:22:00.0005420Z * [new tag] ciflow/inductor-perf-test-nightly/168096 -> ciflow/inductor-perf-test-nightly/168096 2025-12-04T09:22:00.0006048Z * [new tag] ciflow/inductor-periodic/168096 -> ciflow/inductor-periodic/168096 2025-12-04T09:22:00.0007109Z t [tag update] ciflow/inductor-periodic/169024 -> ciflow/inductor-periodic/169024 2025-12-04T09:22:00.0008377Z t [tag update] ciflow/inductor-rocm-mi200/165997 -> ciflow/inductor-rocm-mi200/165997 2025-12-04T09:22:00.0008955Z * [new tag] ciflow/inductor-rocm-mi200/168096 -> ciflow/inductor-rocm-mi200/168096 2025-12-04T09:22:00.0009647Z * [new tag] ciflow/inductor-rocm-mi300/168096 -> ciflow/inductor-rocm-mi300/168096 2025-12-04T09:22:00.0010807Z t [tag update] ciflow/inductor-rocm/162052 -> ciflow/inductor-rocm/162052 2025-12-04T09:22:00.0011418Z * [new tag] ciflow/inductor-rocm/168971 -> ciflow/inductor-rocm/168971 2025-12-04T09:22:00.0012329Z * [new tag] ciflow/inductor-windows/168096 -> ciflow/inductor-windows/168096 2025-12-04T09:22:00.0013363Z t [tag update] ciflow/inductor/148492 -> ciflow/inductor/148492 2025-12-04T09:22:00.0014445Z t [tag update] ciflow/inductor/157149 -> ciflow/inductor/157149 2025-12-04T09:22:00.0015627Z t [tag update] ciflow/inductor/162052 -> ciflow/inductor/162052 2025-12-04T09:22:00.0016592Z t [tag update] ciflow/inductor/162275 -> ciflow/inductor/162275 2025-12-04T09:22:00.0017819Z t [tag update] ciflow/inductor/163245 -> ciflow/inductor/163245 2025-12-04T09:22:00.0018501Z * [new tag] ciflow/inductor/163335 -> ciflow/inductor/163335 2025-12-04T09:22:00.0019195Z * [new tag] ciflow/inductor/163503 -> ciflow/inductor/163503 2025-12-04T09:22:00.0020021Z * [new tag] ciflow/inductor/165270 -> ciflow/inductor/165270 2025-12-04T09:22:00.0021075Z t [tag update] ciflow/inductor/165274 -> ciflow/inductor/165274 2025-12-04T09:22:00.0021758Z * [new tag] ciflow/inductor/165322 -> ciflow/inductor/165322 2025-12-04T09:22:00.0022475Z * [new tag] ciflow/inductor/165597 -> ciflow/inductor/165597 2025-12-04T09:22:00.0023530Z t [tag update] ciflow/inductor/166063 -> ciflow/inductor/166063 2025-12-04T09:22:00.0024577Z t [tag update] ciflow/inductor/166165 -> ciflow/inductor/166165 2025-12-04T09:22:00.0025555Z t [tag update] ciflow/inductor/166483 -> ciflow/inductor/166483 2025-12-04T09:22:00.0026302Z * [new tag] ciflow/inductor/166545 -> ciflow/inductor/166545 2025-12-04T09:22:00.0027057Z * [new tag] ciflow/inductor/167300 -> ciflow/inductor/167300 2025-12-04T09:22:00.0028290Z t [tag update] ciflow/inductor/167552 -> ciflow/inductor/167552 2025-12-04T09:22:00.0029157Z t [tag update] ciflow/inductor/167647 -> ciflow/inductor/167647 2025-12-04T09:22:00.0030239Z t [tag update] ciflow/inductor/167677 -> ciflow/inductor/167677 2025-12-04T09:22:00.0031202Z t [tag update] ciflow/inductor/167680 -> ciflow/inductor/167680 2025-12-04T09:22:00.0031955Z * [new tag] ciflow/inductor/167695 -> ciflow/inductor/167695 2025-12-04T09:22:00.0033021Z t [tag update] ciflow/inductor/167742 -> ciflow/inductor/167742 2025-12-04T09:22:00.0033719Z * [new tag] ciflow/inductor/167768 -> ciflow/inductor/167768 2025-12-04T09:22:00.0034865Z t [tag update] ciflow/inductor/167781 -> ciflow/inductor/167781 2025-12-04T09:22:00.0035556Z * [new tag] ciflow/inductor/167887 -> ciflow/inductor/167887 2025-12-04T09:22:00.0036594Z t [tag update] ciflow/inductor/167972 -> ciflow/inductor/167972 2025-12-04T09:22:00.0037690Z t [tag update] ciflow/inductor/167989 -> ciflow/inductor/167989 2025-12-04T09:22:00.0038369Z * [new tag] ciflow/inductor/168002 -> ciflow/inductor/168002 2025-12-04T09:22:00.0039454Z t [tag update] ciflow/inductor/168050 -> ciflow/inductor/168050 2025-12-04T09:22:00.0040509Z t [tag update] ciflow/inductor/168051 -> ciflow/inductor/168051 2025-12-04T09:22:00.0041555Z t [tag update] ciflow/inductor/168052 -> ciflow/inductor/168052 2025-12-04T09:22:00.0042218Z * [new tag] ciflow/inductor/168096 -> ciflow/inductor/168096 2025-12-04T09:22:00.0043284Z t [tag update] ciflow/inductor/168114 -> ciflow/inductor/168114 2025-12-04T09:22:00.0044251Z t [tag update] ciflow/inductor/168115 -> ciflow/inductor/168115 2025-12-04T09:22:00.0045346Z t [tag update] ciflow/inductor/168127 -> ciflow/inductor/168127 2025-12-04T09:22:00.0046451Z t [tag update] ciflow/inductor/168157 -> ciflow/inductor/168157 2025-12-04T09:22:00.0047152Z * [new tag] ciflow/inductor/168175 -> ciflow/inductor/168175 2025-12-04T09:22:00.0048241Z t [tag update] ciflow/inductor/168185 -> ciflow/inductor/168185 2025-12-04T09:22:00.0049210Z t [tag update] ciflow/inductor/168195 -> ciflow/inductor/168195 2025-12-04T09:22:00.0050340Z t [tag update] ciflow/inductor/168266 -> ciflow/inductor/168266 2025-12-04T09:22:00.0051481Z t [tag update] ciflow/inductor/168326 -> ciflow/inductor/168326 2025-12-04T09:22:00.0052450Z t [tag update] ciflow/inductor/168368 -> ciflow/inductor/168368 2025-12-04T09:22:00.0053560Z t [tag update] ciflow/inductor/168894 -> ciflow/inductor/168894 2025-12-04T09:22:00.0054535Z t [tag update] ciflow/inductor/168939 -> ciflow/inductor/168939 2025-12-04T09:22:00.0055634Z t [tag update] ciflow/inductor/168946 -> ciflow/inductor/168946 2025-12-04T09:22:00.0056615Z t [tag update] ciflow/inductor/168950 -> ciflow/inductor/168950 2025-12-04T09:22:00.0057707Z t [tag update] ciflow/inductor/168951 -> ciflow/inductor/168951 2025-12-04T09:22:00.0058752Z t [tag update] ciflow/inductor/168952 -> ciflow/inductor/168952 2025-12-04T09:22:00.0059860Z t [tag update] ciflow/inductor/168955 -> ciflow/inductor/168955 2025-12-04T09:22:00.0060490Z * [new tag] ciflow/inductor/168971 -> ciflow/inductor/168971 2025-12-04T09:22:00.0061597Z t [tag update] ciflow/inductor/169006 -> ciflow/inductor/169006 2025-12-04T09:22:00.0062725Z t [tag update] ciflow/inductor/169024 -> ciflow/inductor/169024 2025-12-04T09:22:00.0063693Z t [tag update] ciflow/inductor/169025 -> ciflow/inductor/169025 2025-12-04T09:22:00.0064973Z t [tag update] ciflow/inductor/169066 -> ciflow/inductor/169066 2025-12-04T09:22:00.0065819Z t [tag update] ciflow/inductor/169102 -> ciflow/inductor/169102 2025-12-04T09:22:00.0066905Z t [tag update] ciflow/inductor/169103 -> ciflow/inductor/169103 2025-12-04T09:22:00.0067960Z t [tag update] ciflow/inductor/169121 -> ciflow/inductor/169121 2025-12-04T09:22:00.0069025Z t [tag update] ciflow/inductor/169141 -> ciflow/inductor/169141 2025-12-04T09:22:00.0070075Z t [tag update] ciflow/inductor/169151 -> ciflow/inductor/169151 2025-12-04T09:22:00.0071219Z t [tag update] ciflow/inductor/169200 -> ciflow/inductor/169200 2025-12-04T09:22:00.0071886Z * [new tag] ciflow/inductor/169204 -> ciflow/inductor/169204 2025-12-04T09:22:00.0073147Z t [tag update] ciflow/inductor/169230 -> ciflow/inductor/169230 2025-12-04T09:22:00.0074258Z t [tag update] ciflow/inductor/169260 -> ciflow/inductor/169260 2025-12-04T09:22:00.0075331Z t [tag update] ciflow/inductor/169282 -> ciflow/inductor/169282 2025-12-04T09:22:00.0076456Z t [tag update] ciflow/inductor/169319 -> ciflow/inductor/169319 2025-12-04T09:22:00.0077498Z t [tag update] ciflow/inductor/169326 -> ciflow/inductor/169326 2025-12-04T09:22:00.0078582Z t [tag update] ciflow/inductor/169340 -> ciflow/inductor/169340 2025-12-04T09:22:00.0079685Z t [tag update] ciflow/inductor/169341 -> ciflow/inductor/169341 2025-12-04T09:22:00.0080767Z t [tag update] ciflow/inductor/169355 -> ciflow/inductor/169355 2025-12-04T09:22:00.0081881Z t [tag update] ciflow/inductor/169393 -> ciflow/inductor/169393 2025-12-04T09:22:00.0082857Z t [tag update] ciflow/inductor/169415 -> ciflow/inductor/169415 2025-12-04T09:22:00.0084160Z t [tag update] ciflow/inductor/169430 -> ciflow/inductor/169430 2025-12-04T09:22:00.0084947Z * [new tag] ciflow/inductor/169437 -> ciflow/inductor/169437 2025-12-04T09:22:00.0086037Z t [tag update] ciflow/inductor/169438 -> ciflow/inductor/169438 2025-12-04T09:22:00.0087149Z t [tag update] ciflow/inductor/169441 -> ciflow/inductor/169441 2025-12-04T09:22:00.0088192Z t [tag update] ciflow/inductor/169446 -> ciflow/inductor/169446 2025-12-04T09:22:00.0089321Z t [tag update] ciflow/inductor/169447 -> ciflow/inductor/169447 2025-12-04T09:22:00.0090391Z t [tag update] ciflow/inductor/169452 -> ciflow/inductor/169452 2025-12-04T09:22:00.0091147Z * [new tag] ciflow/inductor/169455 -> ciflow/inductor/169455 2025-12-04T09:22:00.0091885Z * [new tag] ciflow/inductor/169459 -> ciflow/inductor/169459 2025-12-04T09:22:00.0092712Z * [new tag] ciflow/inductor/169463 -> ciflow/inductor/169463 2025-12-04T09:22:00.0093521Z * [new tag] ciflow/inductor/169476 -> ciflow/inductor/169476 2025-12-04T09:22:00.0094266Z * [new tag] ciflow/inductor/169485 -> ciflow/inductor/169485 2025-12-04T09:22:00.0094995Z * [new tag] ciflow/inductor/169493 -> ciflow/inductor/169493 2025-12-04T09:22:00.0095738Z * [new tag] ciflow/inductor/169496 -> ciflow/inductor/169496 2025-12-04T09:22:00.0096475Z * [new tag] ciflow/inductor/169497 -> ciflow/inductor/169497 2025-12-04T09:22:00.0097230Z * [new tag] ciflow/inductor/169503 -> ciflow/inductor/169503 2025-12-04T09:22:00.0097962Z * [new tag] ciflow/inductor/169504 -> ciflow/inductor/169504 2025-12-04T09:22:00.0098996Z * [new tag] ciflow/inductor/169505 -> ciflow/inductor/169505 2025-12-04T09:22:00.0100184Z * [new tag] ciflow/inductor/169508 -> ciflow/inductor/169508 2025-12-04T09:22:00.0100765Z * [new tag] ciflow/inductor/169509 -> ciflow/inductor/169509 2025-12-04T09:22:00.0101563Z * [new tag] ciflow/inductor/169513 -> ciflow/inductor/169513 2025-12-04T09:22:00.0102339Z * [new tag] ciflow/inductor/169514 -> ciflow/inductor/169514 2025-12-04T09:22:00.0103101Z * [new tag] ciflow/inductor/169515 -> ciflow/inductor/169515 2025-12-04T09:22:00.0103849Z * [new tag] ciflow/inductor/169517 -> ciflow/inductor/169517 2025-12-04T09:22:00.0104596Z * [new tag] ciflow/inductor/169519 -> ciflow/inductor/169519 2025-12-04T09:22:00.0105358Z * [new tag] ciflow/inductor/169520 -> ciflow/inductor/169520 2025-12-04T09:22:00.0106114Z * [new tag] ciflow/inductor/169521 -> ciflow/inductor/169521 2025-12-04T09:22:00.0106892Z * [new tag] ciflow/inductor/169524 -> ciflow/inductor/169524 2025-12-04T09:22:00.0107652Z * [new tag] ciflow/inductor/169527 -> ciflow/inductor/169527 2025-12-04T09:22:00.0108398Z * [new tag] ciflow/inductor/169528 -> ciflow/inductor/169528 2025-12-04T09:22:00.0109235Z * [new tag] ciflow/inductor/169532 -> ciflow/inductor/169532 2025-12-04T09:22:00.0110023Z * [new tag] ciflow/inductor/169535 -> ciflow/inductor/169535 2025-12-04T09:22:00.0110765Z * [new tag] ciflow/inductor/169536 -> ciflow/inductor/169536 2025-12-04T09:22:00.0111590Z * [new tag] ciflow/inductor/169547 -> ciflow/inductor/169547 2025-12-04T09:22:00.0112376Z * [new tag] ciflow/inductor/169548 -> ciflow/inductor/169548 2025-12-04T09:22:00.0113144Z * [new tag] ciflow/inductor/169549 -> ciflow/inductor/169549 2025-12-04T09:22:00.0113887Z * [new tag] ciflow/inductor/169551 -> ciflow/inductor/169551 2025-12-04T09:22:00.0114653Z * [new tag] ciflow/inductor/169552 -> ciflow/inductor/169552 2025-12-04T09:22:00.0115438Z * [new tag] ciflow/inductor/169553 -> ciflow/inductor/169553 2025-12-04T09:22:00.0116189Z * [new tag] ciflow/inductor/169557 -> ciflow/inductor/169557 2025-12-04T09:22:00.0117386Z t [tag update] ciflow/linux-aarch64/166876 -> ciflow/linux-aarch64/166876 2025-12-04T09:22:00.0118339Z t [tag update] ciflow/mps/169017 -> ciflow/mps/169017 2025-12-04T09:22:00.0119030Z * [new tag] ciflow/mps/169478 -> ciflow/mps/169478 2025-12-04T09:22:00.0119768Z * [new tag] ciflow/op-benchmark/169544 -> ciflow/op-benchmark/169544 2025-12-04T09:22:00.0120908Z t [tag update] ciflow/periodic-rocm-mi200/165997 -> ciflow/periodic-rocm-mi200/165997 2025-12-04T09:22:00.0121645Z * [new tag] ciflow/periodic/168096 -> ciflow/periodic/168096 2025-12-04T09:22:00.0122298Z * [new tag] ciflow/periodic/169286 -> ciflow/periodic/169286 2025-12-04T09:22:00.0123563Z * [new tag] ciflow/quantization-periodic/169207 -> ciflow/quantization-periodic/169207 2025-12-04T09:22:00.0124470Z t [tag update] ciflow/rocm-mi200/165997 -> ciflow/rocm-mi200/165997 2025-12-04T09:22:00.0125137Z * [new tag] ciflow/rocm-mi200/168096 -> ciflow/rocm-mi200/168096 2025-12-04T09:22:00.0125971Z * [new tag] ciflow/rocm-mi300/167157 -> ciflow/rocm-mi300/167157 2025-12-04T09:22:00.0126711Z * [new tag] ciflow/rocm-mi300/168096 -> ciflow/rocm-mi300/168096 2025-12-04T09:22:00.0127428Z * [new tag] ciflow/rocm-mi355/167157 -> ciflow/rocm-mi355/167157 2025-12-04T09:22:00.0128605Z t [tag update] ciflow/rocm/115316 -> ciflow/rocm/115316 2025-12-04T09:22:00.0129641Z t [tag update] ciflow/rocm/148492 -> ciflow/rocm/148492 2025-12-04T09:22:00.0130418Z * [new tag] ciflow/rocm/161607 -> ciflow/rocm/161607 2025-12-04T09:22:00.0131253Z t [tag update] ciflow/rocm/162052 -> ciflow/rocm/162052 2025-12-04T09:22:00.0132331Z t [tag update] ciflow/rocm/165997 -> ciflow/rocm/165997 2025-12-04T09:22:00.0133273Z t [tag update] ciflow/rocm/166165 -> ciflow/rocm/166165 2025-12-04T09:22:00.0134388Z t [tag update] ciflow/rocm/167781 -> ciflow/rocm/167781 2025-12-04T09:22:00.0135302Z t [tag update] ciflow/rocm/167989 -> ciflow/rocm/167989 2025-12-04T09:22:00.0136382Z t [tag update] ciflow/rocm/168195 -> ciflow/rocm/168195 2025-12-04T09:22:00.0137412Z t [tag update] ciflow/rocm/168939 -> ciflow/rocm/168939 2025-12-04T09:22:00.0138037Z * [new tag] ciflow/rocm/168971 -> ciflow/rocm/168971 2025-12-04T09:22:00.0139080Z t [tag update] ciflow/rocm/169024 -> ciflow/rocm/169024 2025-12-04T09:22:00.0140008Z t [tag update] ciflow/rocm/169200 -> ciflow/rocm/169200 2025-12-04T09:22:00.0141089Z t [tag update] ciflow/rocm/169312 -> ciflow/rocm/169312 2025-12-04T09:22:00.0141752Z * [new tag] ciflow/rocm/169455 -> ciflow/rocm/169455 2025-12-04T09:22:00.0142411Z * [new tag] ciflow/rocm/169470 -> ciflow/rocm/169470 2025-12-04T09:22:00.0143061Z * [new tag] ciflow/rocm/169471 -> ciflow/rocm/169471 2025-12-04T09:22:00.0143720Z * [new tag] ciflow/rocm/169472 -> ciflow/rocm/169472 2025-12-04T09:22:00.0144380Z * [new tag] ciflow/rocm/169514 -> ciflow/rocm/169514 2025-12-04T09:22:00.0145524Z t [tag update] ciflow/slow/168050 -> ciflow/slow/168050 2025-12-04T09:22:00.0146694Z * [new tag] ciflow/torchbench/168175 -> ciflow/torchbench/168175 2025-12-04T09:22:00.0147622Z t [tag update] ciflow/trunk/148492 -> ciflow/trunk/148492 2025-12-04T09:22:00.0148694Z t [tag update] ciflow/trunk/157149 -> ciflow/trunk/157149 2025-12-04T09:22:00.0149354Z * [new tag] ciflow/trunk/159718 -> ciflow/trunk/159718 2025-12-04T09:22:00.0150416Z t [tag update] ciflow/trunk/162275 -> ciflow/trunk/162275 2025-12-04T09:22:00.0151434Z t [tag update] ciflow/trunk/163245 -> ciflow/trunk/163245 2025-12-04T09:22:00.0152462Z t [tag update] ciflow/trunk/165274 -> ciflow/trunk/165274 2025-12-04T09:22:00.0153158Z * [new tag] ciflow/trunk/165728 -> ciflow/trunk/165728 2025-12-04T09:22:00.0154237Z t [tag update] ciflow/trunk/166165 -> ciflow/trunk/166165 2025-12-04T09:22:00.0154885Z * [new tag] ciflow/trunk/166829 -> ciflow/trunk/166829 2025-12-04T09:22:00.0155701Z * [new tag] ciflow/trunk/166843 -> ciflow/trunk/166843 2025-12-04T09:22:00.0156738Z t [tag update] ciflow/trunk/166876 -> ciflow/trunk/166876 2025-12-04T09:22:00.0157395Z * [new tag] ciflow/trunk/167552 -> ciflow/trunk/167552 2025-12-04T09:22:00.0158078Z * [new tag] ciflow/trunk/167555 -> ciflow/trunk/167555 2025-12-04T09:22:00.0159292Z t [tag update] ciflow/trunk/167672 -> ciflow/trunk/167672 2025-12-04T09:22:00.0159938Z * [new tag] ciflow/trunk/167742 -> ciflow/trunk/167742 2025-12-04T09:22:00.0160990Z t [tag update] ciflow/trunk/167781 -> ciflow/trunk/167781 2025-12-04T09:22:00.0161661Z * [new tag] ciflow/trunk/167887 -> ciflow/trunk/167887 2025-12-04T09:22:00.0162723Z t [tag update] ciflow/trunk/168050 -> ciflow/trunk/168050 2025-12-04T09:22:00.0163503Z * [new tag] ciflow/trunk/168051 -> ciflow/trunk/168051 2025-12-04T09:22:00.0164037Z * [new tag] ciflow/trunk/168096 -> ciflow/trunk/168096 2025-12-04T09:22:00.0165096Z t [tag update] ciflow/trunk/168127 -> ciflow/trunk/168127 2025-12-04T09:22:00.0166023Z t [tag update] ciflow/trunk/168157 -> ciflow/trunk/168157 2025-12-04T09:22:00.0168459Z * [new tag] ciflow/trunk/168175 -> ciflow/trunk/168175 2025-12-04T09:22:00.0169222Z * [new tag] ciflow/trunk/168226 -> ciflow/trunk/168226 2025-12-04T09:22:00.0169893Z * [new tag] ciflow/trunk/168262 -> ciflow/trunk/168262 2025-12-04T09:22:00.0170675Z * [new tag] ciflow/trunk/168328 -> ciflow/trunk/168328 2025-12-04T09:22:00.0171762Z t [tag update] ciflow/trunk/168368 -> ciflow/trunk/168368 2025-12-04T09:22:00.0172755Z t [tag update] ciflow/trunk/168955 -> ciflow/trunk/168955 2025-12-04T09:22:00.0173488Z * [new tag] ciflow/trunk/168980 -> ciflow/trunk/168980 2025-12-04T09:22:00.0174581Z t [tag update] ciflow/trunk/169006 -> ciflow/trunk/169006 2025-12-04T09:22:00.0175603Z t [tag update] ciflow/trunk/169025 -> ciflow/trunk/169025 2025-12-04T09:22:00.0176258Z * [new tag] ciflow/trunk/169048 -> ciflow/trunk/169048 2025-12-04T09:22:00.0177327Z t [tag update] ciflow/trunk/169066 -> ciflow/trunk/169066 2025-12-04T09:22:00.0178354Z t [tag update] ciflow/trunk/169102 -> ciflow/trunk/169102 2025-12-04T09:22:00.0179022Z * [new tag] ciflow/trunk/169103 -> ciflow/trunk/169103 2025-12-04T09:22:00.0179798Z * [new tag] ciflow/trunk/169125 -> ciflow/trunk/169125 2025-12-04T09:22:00.0180570Z * [new tag] ciflow/trunk/169139 -> ciflow/trunk/169139 2025-12-04T09:22:00.0181330Z * [new tag] ciflow/trunk/169148 -> ciflow/trunk/169148 2025-12-04T09:22:00.0182410Z t [tag update] ciflow/trunk/169151 -> ciflow/trunk/169151 2025-12-04T09:22:00.0183484Z t [tag update] ciflow/trunk/169156 -> ciflow/trunk/169156 2025-12-04T09:22:00.0184243Z * [new tag] ciflow/trunk/169176 -> ciflow/trunk/169176 2025-12-04T09:22:00.0184936Z * [new tag] ciflow/trunk/169204 -> ciflow/trunk/169204 2025-12-04T09:22:00.0185596Z * [new tag] ciflow/trunk/169207 -> ciflow/trunk/169207 2025-12-04T09:22:00.0186859Z t [tag update] ciflow/trunk/169231 -> ciflow/trunk/169231 2025-12-04T09:22:00.0187804Z t [tag update] ciflow/trunk/169260 -> ciflow/trunk/169260 2025-12-04T09:22:00.0188901Z t [tag update] ciflow/trunk/169280 -> ciflow/trunk/169280 2025-12-04T09:22:00.0189963Z t [tag update] ciflow/trunk/169296 -> ciflow/trunk/169296 2025-12-04T09:22:00.0190659Z * [new tag] ciflow/trunk/169312 -> ciflow/trunk/169312 2025-12-04T09:22:00.0191365Z * [new tag] ciflow/trunk/169343 -> ciflow/trunk/169343 2025-12-04T09:22:00.0192444Z t [tag update] ciflow/trunk/169355 -> ciflow/trunk/169355 2025-12-04T09:22:00.0193224Z * [new tag] ciflow/trunk/169379 -> ciflow/trunk/169379 2025-12-04T09:22:00.0194319Z t [tag update] ciflow/trunk/169385 -> ciflow/trunk/169385 2025-12-04T09:22:00.0194974Z * [new tag] ciflow/trunk/169387 -> ciflow/trunk/169387 2025-12-04T09:22:00.0195780Z * [new tag] ciflow/trunk/169410 -> ciflow/trunk/169410 2025-12-04T09:22:00.0196485Z * [new tag] ciflow/trunk/169423 -> ciflow/trunk/169423 2025-12-04T09:22:00.0197565Z t [tag update] ciflow/trunk/169430 -> ciflow/trunk/169430 2025-12-04T09:22:00.0198372Z * [new tag] ciflow/trunk/169437 -> ciflow/trunk/169437 2025-12-04T09:22:00.0198935Z * [new tag] ciflow/trunk/169442 -> ciflow/trunk/169442 2025-12-04T09:22:00.0199629Z * [new tag] ciflow/trunk/169452 -> ciflow/trunk/169452 2025-12-04T09:22:00.0200343Z * [new tag] ciflow/trunk/169454 -> ciflow/trunk/169454 2025-12-04T09:22:00.0201048Z * [new tag] ciflow/trunk/169459 -> ciflow/trunk/169459 2025-12-04T09:22:00.0201890Z * [new tag] ciflow/trunk/169474 -> ciflow/trunk/169474 2025-12-04T09:22:00.0202606Z * [new tag] ciflow/trunk/169475 -> ciflow/trunk/169475 2025-12-04T09:22:00.0203318Z * [new tag] ciflow/trunk/169476 -> ciflow/trunk/169476 2025-12-04T09:22:00.0204156Z * [new tag] ciflow/trunk/169487 -> ciflow/trunk/169487 2025-12-04T09:22:00.0204891Z * [new tag] ciflow/trunk/169497 -> ciflow/trunk/169497 2025-12-04T09:22:00.0205599Z * [new tag] ciflow/trunk/169503 -> ciflow/trunk/169503 2025-12-04T09:22:00.0206349Z * [new tag] ciflow/trunk/169505 -> ciflow/trunk/169505 2025-12-04T09:22:00.0207104Z * [new tag] ciflow/trunk/169507 -> ciflow/trunk/169507 2025-12-04T09:22:00.0207845Z * [new tag] ciflow/trunk/169514 -> ciflow/trunk/169514 2025-12-04T09:22:00.0208541Z * [new tag] ciflow/trunk/169517 -> ciflow/trunk/169517 2025-12-04T09:22:00.0209258Z * [new tag] ciflow/trunk/169519 -> ciflow/trunk/169519 2025-12-04T09:22:00.0209987Z * [new tag] ciflow/trunk/169528 -> ciflow/trunk/169528 2025-12-04T09:22:00.0210695Z * [new tag] ciflow/trunk/169541 -> ciflow/trunk/169541 2025-12-04T09:22:00.0211491Z * [new tag] ciflow/trunk/169555 -> ciflow/trunk/169555 2025-12-04T09:22:00.0212200Z * [new tag] ciflow/vllm/165270 -> ciflow/vllm/165270 2025-12-04T09:22:00.0213261Z t [tag update] ciflow/vllm/165274 -> ciflow/vllm/165274 2025-12-04T09:22:00.0213934Z * [new tag] ciflow/xpu/159718 -> ciflow/xpu/159718 2025-12-04T09:22:00.0215029Z t [tag update] ciflow/xpu/166829 -> ciflow/xpu/166829 2025-12-04T09:22:00.0215650Z * [new tag] ciflow/xpu/166843 -> ciflow/xpu/166843 2025-12-04T09:22:00.0216656Z t [tag update] ciflow/xpu/167972 -> ciflow/xpu/167972 2025-12-04T09:22:00.0217698Z t [tag update] ciflow/xpu/168262 -> ciflow/xpu/168262 2025-12-04T09:22:00.0218338Z * [new tag] ciflow/xpu/168328 -> ciflow/xpu/168328 2025-12-04T09:22:00.0219362Z t [tag update] ciflow/xpu/168950 -> ciflow/xpu/168950 2025-12-04T09:22:00.0220377Z t [tag update] ciflow/xpu/169200 -> ciflow/xpu/169200 2025-12-04T09:22:00.0221389Z t [tag update] ciflow/xpu/169203 -> ciflow/xpu/169203 2025-12-04T09:22:00.0222378Z t [tag update] ciflow/xpu/169230 -> ciflow/xpu/169230 2025-12-04T09:22:00.0223368Z t [tag update] ciflow/xpu/169231 -> ciflow/xpu/169231 2025-12-04T09:22:00.0224372Z t [tag update] ciflow/xpu/169280 -> ciflow/xpu/169280 2025-12-04T09:22:00.0225383Z t [tag update] ciflow/xpu/169296 -> ciflow/xpu/169296 2025-12-04T09:22:00.0226034Z * [new tag] ciflow/xpu/169410 -> ciflow/xpu/169410 2025-12-04T09:22:00.0226712Z * [new tag] ciflow/xpu/169442 -> ciflow/xpu/169442 2025-12-04T09:22:00.0227418Z * [new tag] ciflow/xpu/169555 -> ciflow/xpu/169555 2025-12-04T09:22:00.0228851Z * [new tag] trunk/02d8bd6974cf84b721680d773dbdb1b6f40ce272 -> trunk/02d8bd6974cf84b721680d773dbdb1b6f40ce272 2025-12-04T09:22:00.0229445Z * [new tag] trunk/07dcc0b83db3211653a38565a24e15acdba75654 -> trunk/07dcc0b83db3211653a38565a24e15acdba75654 2025-12-04T09:22:00.0230191Z * [new tag] trunk/09076941a95c76f4d9ad189d064dfd8baa39e672 -> trunk/09076941a95c76f4d9ad189d064dfd8baa39e672 2025-12-04T09:22:00.0230972Z * [new tag] trunk/0bbbdf1750567a980634ad907a325357ba8ba8f2 -> trunk/0bbbdf1750567a980634ad907a325357ba8ba8f2 2025-12-04T09:22:00.0231761Z * [new tag] trunk/18f3ca08f13b8de61307f5e8cd7d4cccb67e9d11 -> trunk/18f3ca08f13b8de61307f5e8cd7d4cccb67e9d11 2025-12-04T09:22:00.0232489Z * [new tag] trunk/195f92e98d3d66738577f11f22c4b5c8a1c76dd5 -> trunk/195f92e98d3d66738577f11f22c4b5c8a1c76dd5 2025-12-04T09:22:00.0233235Z * [new tag] trunk/1afe2832f58e24e54a5bfda5a5afa9b96fdea40e -> trunk/1afe2832f58e24e54a5bfda5a5afa9b96fdea40e 2025-12-04T09:22:00.0233969Z * [new tag] trunk/1c87554d74140eaee964ca8b1832cede67f5f520 -> trunk/1c87554d74140eaee964ca8b1832cede67f5f520 2025-12-04T09:22:00.0234724Z * [new tag] trunk/1d21b4df2babe322e5d085ceb6de884eb260a62d -> trunk/1d21b4df2babe322e5d085ceb6de884eb260a62d 2025-12-04T09:22:00.0235520Z * [new tag] trunk/201e2c4117eb9744594dad6a5c18213d7b4705d7 -> trunk/201e2c4117eb9744594dad6a5c18213d7b4705d7 2025-12-04T09:22:00.0236246Z * [new tag] trunk/2353a0f60eb4b4cb6675907a7fa9fbedc1c02e7f -> trunk/2353a0f60eb4b4cb6675907a7fa9fbedc1c02e7f 2025-12-04T09:22:00.0236976Z * [new tag] trunk/2887faaec6295d081580d09fce161201826c6d87 -> trunk/2887faaec6295d081580d09fce161201826c6d87 2025-12-04T09:22:00.0237708Z * [new tag] trunk/296e67c92635443c67b11c0ae1bd045f03ebb7bc -> trunk/296e67c92635443c67b11c0ae1bd045f03ebb7bc 2025-12-04T09:22:00.0238472Z * [new tag] trunk/2ac3ef882afb23136adc188975f0a8802fc68adf -> trunk/2ac3ef882afb23136adc188975f0a8802fc68adf 2025-12-04T09:22:00.0239242Z * [new tag] trunk/2df6058f116a65722a0e03073402feb242572d35 -> trunk/2df6058f116a65722a0e03073402feb242572d35 2025-12-04T09:22:00.0239959Z * [new tag] trunk/2e0c2e170fe658c440775c8e5c44228aafcc47ec -> trunk/2e0c2e170fe658c440775c8e5c44228aafcc47ec 2025-12-04T09:22:00.0240758Z * [new tag] trunk/305168768a95d69c444df5cd334bb774edfe06f1 -> trunk/305168768a95d69c444df5cd334bb774edfe06f1 2025-12-04T09:22:00.0241511Z * [new tag] trunk/320de0c6b0a3e7c6d2693ea5c28d5d0156ba7991 -> trunk/320de0c6b0a3e7c6d2693ea5c28d5d0156ba7991 2025-12-04T09:22:00.0242244Z * [new tag] trunk/34a98608afa0cb5b48f0d6d30432fdd0a2614ddf -> trunk/34a98608afa0cb5b48f0d6d30432fdd0a2614ddf 2025-12-04T09:22:00.0242977Z * [new tag] trunk/35b7a9a26c5923d98aebaa41a031dae21788a9ee -> trunk/35b7a9a26c5923d98aebaa41a031dae21788a9ee 2025-12-04T09:22:00.0243705Z * [new tag] trunk/39d07dbf03a911bdd45d1af78d8638dc92074938 -> trunk/39d07dbf03a911bdd45d1af78d8638dc92074938 2025-12-04T09:22:00.0244450Z * [new tag] trunk/3d35fd20a78ff4d016fa80f4e5fad37191d7bcae -> trunk/3d35fd20a78ff4d016fa80f4e5fad37191d7bcae 2025-12-04T09:22:00.0245204Z * [new tag] trunk/43b94713bbf340d3c124fde02d0f73add4021247 -> trunk/43b94713bbf340d3c124fde02d0f73add4021247 2025-12-04T09:22:00.0246009Z * [new tag] trunk/47b28ddf7bd74b50fa93b307a7d3b183a6d77f54 -> trunk/47b28ddf7bd74b50fa93b307a7d3b183a6d77f54 2025-12-04T09:22:00.0246862Z * [new tag] trunk/49a04d26088acc17d948ddd66920f3e16371e873 -> trunk/49a04d26088acc17d948ddd66920f3e16371e873 2025-12-04T09:22:00.0247659Z * [new tag] trunk/4fefb8e7e942386ffac764a41b232241f82bea3a -> trunk/4fefb8e7e942386ffac764a41b232241f82bea3a 2025-12-04T09:22:00.0248419Z * [new tag] trunk/5191b2fa68ba19960912bfd7fd721c79d76bb1f3 -> trunk/5191b2fa68ba19960912bfd7fd721c79d76bb1f3 2025-12-04T09:22:00.0249273Z * [new tag] trunk/52ac0f0dc4acacd219f1317fbc28ec631c01e07a -> trunk/52ac0f0dc4acacd219f1317fbc28ec631c01e07a 2025-12-04T09:22:00.0249988Z * [new tag] trunk/5634469fda9e5d98869c82c7d03bb08914245f96 -> trunk/5634469fda9e5d98869c82c7d03bb08914245f96 2025-12-04T09:22:00.0250744Z * [new tag] trunk/597930f6b568852356ca9795dac76f9e4653adbd -> trunk/597930f6b568852356ca9795dac76f9e4653adbd 2025-12-04T09:22:00.0251501Z * [new tag] trunk/597df3a4e2a67b9fdbe1a89b2f4d74f822274db6 -> trunk/597df3a4e2a67b9fdbe1a89b2f4d74f822274db6 2025-12-04T09:22:00.0252259Z * [new tag] trunk/5bf1cdf4755c54ef462b44cb8041b0a57311556b -> trunk/5bf1cdf4755c54ef462b44cb8041b0a57311556b 2025-12-04T09:22:00.0253025Z * [new tag] trunk/5f21d27e71268464d362a96c9ac09ea475f7f202 -> trunk/5f21d27e71268464d362a96c9ac09ea475f7f202 2025-12-04T09:22:00.0253773Z * [new tag] trunk/61be54a31dc09b59d99b62176fb935aee0b924ef -> trunk/61be54a31dc09b59d99b62176fb935aee0b924ef 2025-12-04T09:22:00.0254527Z * [new tag] trunk/65c4620d6bb0c6029f69762c22b91dda2294da9a -> trunk/65c4620d6bb0c6029f69762c22b91dda2294da9a 2025-12-04T09:22:00.0255321Z * [new tag] trunk/6c8b6a043f1628188b6396b3a2a6e000ca68362b -> trunk/6c8b6a043f1628188b6396b3a2a6e000ca68362b 2025-12-04T09:22:00.0256409Z * [new tag] trunk/6ceb4a32f92ae67ce5d7d97931d17401ebf5ffa5 -> trunk/6ceb4a32f92ae67ce5d7d97931d17401ebf5ffa5 2025-12-04T09:22:00.0257151Z * [new tag] trunk/6e404e9b7d6f5fb0de86aa73888c3038248c17f8 -> trunk/6e404e9b7d6f5fb0de86aa73888c3038248c17f8 2025-12-04T09:22:00.0257903Z * [new tag] trunk/6f53fefeb90ad3281119b5cfc4aa9ffd8a066e3d -> trunk/6f53fefeb90ad3281119b5cfc4aa9ffd8a066e3d 2025-12-04T09:22:00.0258585Z * [new tag] trunk/6ff831180d2fa436c7f1c1af3adac641fce9d60e -> trunk/6ff831180d2fa436c7f1c1af3adac641fce9d60e 2025-12-04T09:22:00.0259307Z * [new tag] trunk/7348cb355ff0a6f79cd4871215aea72185748734 -> trunk/7348cb355ff0a6f79cd4871215aea72185748734 2025-12-04T09:22:00.0260102Z * [new tag] trunk/74fe26a1ebe32931783569f2e762e3c2c974901f -> trunk/74fe26a1ebe32931783569f2e762e3c2c974901f 2025-12-04T09:22:00.0260811Z * [new tag] trunk/76aeb8c7e0f795b3fddca134cbea9a69da3ee696 -> trunk/76aeb8c7e0f795b3fddca134cbea9a69da3ee696 2025-12-04T09:22:00.0261896Z * [new tag] trunk/7716da9fb23f27a65b41f9f016a2afadf281c18f -> trunk/7716da9fb23f27a65b41f9f016a2afadf281c18f 2025-12-04T09:22:00.0262538Z * [new tag] trunk/78adb3b3df41b45d2368b67226d2f864b78939a6 -> trunk/78adb3b3df41b45d2368b67226d2f864b78939a6 2025-12-04T09:22:00.0263291Z * [new tag] trunk/7b7af390ea8541c611d1ce2018a6934188fc197b -> trunk/7b7af390ea8541c611d1ce2018a6934188fc197b 2025-12-04T09:22:00.0264002Z * [new tag] trunk/7ba4680f3755a560af81aa0f688791e367aa3609 -> trunk/7ba4680f3755a560af81aa0f688791e367aa3609 2025-12-04T09:22:00.0264736Z * [new tag] trunk/7cbc2d034cecd21ab5c9707d0a9c525c17143fb8 -> trunk/7cbc2d034cecd21ab5c9707d0a9c525c17143fb8 2025-12-04T09:22:00.0265468Z * [new tag] trunk/7eb625920054b1126a7d2d99818aaa188c6ba95e -> trunk/7eb625920054b1126a7d2d99818aaa188c6ba95e 2025-12-04T09:22:00.0266205Z * [new tag] trunk/85a315917efe82c24306be805c584ec044951c75 -> trunk/85a315917efe82c24306be805c584ec044951c75 2025-12-04T09:22:00.0266908Z * [new tag] trunk/87329491c82a5f8c1cc4ec11d8f55a5de2551ece -> trunk/87329491c82a5f8c1cc4ec11d8f55a5de2551ece 2025-12-04T09:22:00.0267649Z * [new tag] trunk/89e3bbcb5b5321dc8b9520b4d5a8ee60cea1d0b4 -> trunk/89e3bbcb5b5321dc8b9520b4d5a8ee60cea1d0b4 2025-12-04T09:22:00.0268334Z * [new tag] trunk/8c73bbbb02159223c0c97d268a0a74cb78158a1c -> trunk/8c73bbbb02159223c0c97d268a0a74cb78158a1c 2025-12-04T09:22:00.0269153Z * [new tag] trunk/8d56e98c8db988a22cb2dfaeefb30bc7d2a3cc43 -> trunk/8d56e98c8db988a22cb2dfaeefb30bc7d2a3cc43 2025-12-04T09:22:00.0269894Z * [new tag] trunk/9b3e34d8589b29f7b4e7fab6f78711b7ca6e4639 -> trunk/9b3e34d8589b29f7b4e7fab6f78711b7ca6e4639 2025-12-04T09:22:00.0270612Z * [new tag] trunk/9f0df5686cb4ada94f94620acba2e3c3f363b11d -> trunk/9f0df5686cb4ada94f94620acba2e3c3f363b11d 2025-12-04T09:22:00.0271394Z * [new tag] trunk/a15066c28b3145e6edbfc88359d0411d14cfc70c -> trunk/a15066c28b3145e6edbfc88359d0411d14cfc70c 2025-12-04T09:22:00.0272145Z * [new tag] trunk/abfa1a6d65c7c159e35c72c25979b9da4971689e -> trunk/abfa1a6d65c7c159e35c72c25979b9da4971689e 2025-12-04T09:22:00.0272838Z * [new tag] trunk/ae3a2395bf66151078e2d201716f7d63ce1c6f3e -> trunk/ae3a2395bf66151078e2d201716f7d63ce1c6f3e 2025-12-04T09:22:00.0273572Z * [new tag] trunk/b1decff555cd50e2123c8c6e25cc0d447c411f62 -> trunk/b1decff555cd50e2123c8c6e25cc0d447c411f62 2025-12-04T09:22:00.0274271Z * [new tag] trunk/b2b6b034c9fd08672c40e63ef243556ad4c49bd2 -> trunk/b2b6b034c9fd08672c40e63ef243556ad4c49bd2 2025-12-04T09:22:00.0275000Z * [new tag] trunk/b3a7edb2311367974cc7cd764cfb11a5d6758b24 -> trunk/b3a7edb2311367974cc7cd764cfb11a5d6758b24 2025-12-04T09:22:00.0275709Z * [new tag] trunk/b6b6c80379388b7f9932c3e6a0f9907bf430e417 -> trunk/b6b6c80379388b7f9932c3e6a0f9907bf430e417 2025-12-04T09:22:00.0276395Z * [new tag] trunk/b6b6d912df0b6f4082f8e50b18bd1de1dd7325f4 -> trunk/b6b6d912df0b6f4082f8e50b18bd1de1dd7325f4 2025-12-04T09:22:00.0277208Z * [new tag] trunk/ba1412546f3082c0958c077acc2025e4dbc33f1f -> trunk/ba1412546f3082c0958c077acc2025e4dbc33f1f 2025-12-04T09:22:00.0277939Z * [new tag] trunk/bac403c0b38c63bdbcc0c31f1c2b0bc0260f610f -> trunk/bac403c0b38c63bdbcc0c31f1c2b0bc0260f610f 2025-12-04T09:22:00.0278644Z * [new tag] trunk/bc39b2b3bc7a6e19a42e62bd576974035086fe55 -> trunk/bc39b2b3bc7a6e19a42e62bd576974035086fe55 2025-12-04T09:22:00.0279362Z * [new tag] trunk/bc43d5b297f207a11d83d77ddf0152bdaabe15a8 -> trunk/bc43d5b297f207a11d83d77ddf0152bdaabe15a8 2025-12-04T09:22:00.0280061Z * [new tag] trunk/bc6a4863c7246a6493d16d4ea6eee71ec07c6a09 -> trunk/bc6a4863c7246a6493d16d4ea6eee71ec07c6a09 2025-12-04T09:22:00.0280772Z * [new tag] trunk/bea4912944defdbcb8b061800caab6cbbbd01df5 -> trunk/bea4912944defdbcb8b061800caab6cbbbd01df5 2025-12-04T09:22:00.0281523Z * [new tag] trunk/c55b1e8f61d041ee436d697449eb028931d574fb -> trunk/c55b1e8f61d041ee436d697449eb028931d574fb 2025-12-04T09:22:00.0282234Z * [new tag] trunk/cc0853af42122f8185321f542616f4474e717f09 -> trunk/cc0853af42122f8185321f542616f4474e717f09 2025-12-04T09:22:00.0282965Z * [new tag] trunk/d038b0130ec7c20ebcac219301292fd8e98a1ace -> trunk/d038b0130ec7c20ebcac219301292fd8e98a1ace 2025-12-04T09:22:00.0283698Z * [new tag] trunk/d16447dacaf2420ea175f0c275c75da951f57d39 -> trunk/d16447dacaf2420ea175f0c275c75da951f57d39 2025-12-04T09:22:00.0284413Z * [new tag] trunk/d40f4950f2b7f7aa380a22fe0f6166e71680fbcf -> trunk/d40f4950f2b7f7aa380a22fe0f6166e71680fbcf 2025-12-04T09:22:00.0285134Z * [new tag] trunk/d54ff42903c2ae0533931ff11d23b35f875bdb3d -> trunk/d54ff42903c2ae0533931ff11d23b35f875bdb3d 2025-12-04T09:22:00.0285852Z * [new tag] trunk/d78f52b199c547106d4cd9d2856dd0805c118bf1 -> trunk/d78f52b199c547106d4cd9d2856dd0805c118bf1 2025-12-04T09:22:00.0288251Z * [new tag] trunk/d900f5e86745dec76713f4b0ef07005ef36b2f5a -> trunk/d900f5e86745dec76713f4b0ef07005ef36b2f5a 2025-12-04T09:22:00.0289030Z * [new tag] trunk/dd18a75336a4fbd7497955cc5665904724fce889 -> trunk/dd18a75336a4fbd7497955cc5665904724fce889 2025-12-04T09:22:00.0289790Z * [new tag] trunk/dfbd3714d15c37a7b83b322a6b60f997fc00f50c -> trunk/dfbd3714d15c37a7b83b322a6b60f997fc00f50c 2025-12-04T09:22:00.0290598Z * [new tag] trunk/e115f9f4e4b039f8e9a642aaa2bd8254a920541b -> trunk/e115f9f4e4b039f8e9a642aaa2bd8254a920541b 2025-12-04T09:22:00.0291361Z * [new tag] trunk/e3f24fd73ad74c6e7176687986436956c7c18235 -> trunk/e3f24fd73ad74c6e7176687986436956c7c18235 2025-12-04T09:22:00.0291895Z * [new tag] trunk/e7d24d3ff93d1503ba63860b7057438ad93f918e -> trunk/e7d24d3ff93d1503ba63860b7057438ad93f918e 2025-12-04T09:22:00.0292575Z * [new tag] trunk/ea7035f462a0d2830865ee86c832bd101e1427fc -> trunk/ea7035f462a0d2830865ee86c832bd101e1427fc 2025-12-04T09:22:00.0293272Z * [new tag] trunk/eabb7ad2128580ef674446027b95bcf4e21e8df3 -> trunk/eabb7ad2128580ef674446027b95bcf4e21e8df3 2025-12-04T09:22:00.0293992Z * [new tag] trunk/eb5c63652a33da42e7018c23df5f20a3eb4c6ccf -> trunk/eb5c63652a33da42e7018c23df5f20a3eb4c6ccf 2025-12-04T09:22:00.0294708Z * [new tag] trunk/ec2c71f5c85021b8938cdafadce24c15a36fd93e -> trunk/ec2c71f5c85021b8938cdafadce24c15a36fd93e 2025-12-04T09:22:00.0295421Z * [new tag] trunk/ecbcc3f6bf327856b435b259ac63cc2f328c4b4e -> trunk/ecbcc3f6bf327856b435b259ac63cc2f328c4b4e 2025-12-04T09:22:00.0296123Z * [new tag] trunk/ee87bbe876c42575e961b32a0827d76bc9782ca2 -> trunk/ee87bbe876c42575e961b32a0827d76bc9782ca2 2025-12-04T09:22:00.0296946Z * [new tag] trunk/f7e1bd80a063e17453c361837ba6ea2570920a73 -> trunk/f7e1bd80a063e17453c361837ba6ea2570920a73 2025-12-04T09:22:00.0297831Z * [new tag] trunk/f9bd6c53624c7c0ea3772de78498326e84c2f0e7 -> trunk/f9bd6c53624c7c0ea3772de78498326e84c2f0e7 2025-12-04T09:22:00.0298399Z * [new tag] trunk/fdf863d5e1de3b2688c9511e96876e34581dbfd7 -> trunk/fdf863d5e1de3b2688c9511e96876e34581dbfd7 2025-12-04T09:22:00.0299099Z * [new tag] trunk/fe0e65adfc0e7ca6e5f57e6ea8b16bd5cc967307 -> trunk/fe0e65adfc0e7ca6e5f57e6ea8b16bd5cc967307 2025-12-04T09:22:00.0299835Z * [new tag] trunk/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 -> trunk/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:22:00.0309885Z * [new tag] viable/strict/1764793566 -> viable/strict/1764793566 2025-12-04T09:22:00.0310763Z * [new tag] viable/strict/1764797093 -> viable/strict/1764797093 2025-12-04T09:22:00.0311618Z * [new tag] viable/strict/1764800729 -> viable/strict/1764800729 2025-12-04T09:22:00.1054878Z [command]/usr/bin/git rev-parse --verify --quiet ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32^{object} 2025-12-04T09:22:00.1095240Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:22:00.1101772Z ##[endgroup] 2025-12-04T09:22:00.1102368Z ##[group]Determining the checkout info 2025-12-04T09:22:00.1103013Z ##[endgroup] 2025-12-04T09:22:00.1107605Z [command]/usr/bin/git sparse-checkout disable 2025-12-04T09:22:00.1269647Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-12-04T09:22:00.1302058Z ##[group]Checking out the ref 2025-12-04T09:22:00.1307421Z [command]/usr/bin/git checkout --progress --force ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:22:00.3407204Z Previous HEAD position was 135f3753c41 [xpu][feature] [3/3] Register the `scaled_mm` and `scaled_mm_v2` for xpu (#166056) 2025-12-04T09:22:00.3412849Z HEAD is now at ffd9b0fb435 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T09:22:00.3470796Z ##[endgroup] 2025-12-04T09:22:00.3471391Z ##[group]Setting up auth for fetching submodules 2025-12-04T09:22:00.3476259Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T09:22:00.3519137Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-12-04T09:22:00.3566405Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-12-04T09:22:00.3589210Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-12-04T09:22:00.3608126Z ##[endgroup] 2025-12-04T09:22:00.3608821Z ##[group]Fetching submodules 2025-12-04T09:22:00.3609695Z [command]/usr/bin/git submodule sync --recursive 2025-12-04T09:22:00.3859611Z Synchronizing submodule url for 'android/libs/fbjni' 2025-12-04T09:22:00.3904257Z Synchronizing submodule url for 'third_party/FP16' 2025-12-04T09:22:00.3946741Z Synchronizing submodule url for 'third_party/FXdiv' 2025-12-04T09:22:00.3985073Z Synchronizing submodule url for 'third_party/NNPACK' 2025-12-04T09:22:00.4027198Z Synchronizing submodule url for 'third_party/NVTX' 2025-12-04T09:22:00.4066779Z Synchronizing submodule url for 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:00.4108066Z Synchronizing submodule url for 'third_party/XNNPACK' 2025-12-04T09:22:00.4156016Z Synchronizing submodule url for 'third_party/aiter' 2025-12-04T09:22:00.4194206Z Synchronizing submodule url for 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:00.4250821Z Synchronizing submodule url for 'third_party/benchmark' 2025-12-04T09:22:00.4297881Z Synchronizing submodule url for 'third_party/composable_kernel' 2025-12-04T09:22:00.4346513Z Synchronizing submodule url for 'third_party/cpp-httplib' 2025-12-04T09:22:00.4385675Z Synchronizing submodule url for 'third_party/cpuinfo' 2025-12-04T09:22:00.4429831Z Synchronizing submodule url for 'third_party/cudnn_frontend' 2025-12-04T09:22:00.4471850Z Synchronizing submodule url for 'third_party/cutlass' 2025-12-04T09:22:00.4514737Z Synchronizing submodule url for 'third_party/fbgemm' 2025-12-04T09:22:00.4558875Z Synchronizing submodule url for 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:00.4595163Z Synchronizing submodule url for 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:00.4645932Z Synchronizing submodule url for 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:00.4675343Z Synchronizing submodule url for 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:00.4719557Z Synchronizing submodule url for 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:00.4768818Z Synchronizing submodule url for 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:00.4802051Z Synchronizing submodule url for 'third_party/fbgemm/external/json' 2025-12-04T09:22:00.4853302Z Synchronizing submodule url for 'third_party/flash-attention' 2025-12-04T09:22:00.4885064Z Synchronizing submodule url for 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:00.4930800Z Synchronizing submodule url for 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:00.4979588Z Synchronizing submodule url for 'third_party/flatbuffers' 2025-12-04T09:22:00.5022418Z Synchronizing submodule url for 'third_party/fmt' 2025-12-04T09:22:00.5068952Z Synchronizing submodule url for 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:00.5119301Z Synchronizing submodule url for 'third_party/gloo' 2025-12-04T09:22:00.5157905Z Synchronizing submodule url for 'third_party/googletest' 2025-12-04T09:22:00.5202853Z Synchronizing submodule url for 'third_party/ideep' 2025-12-04T09:22:00.5243539Z Synchronizing submodule url for 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:00.5295700Z Synchronizing submodule url for 'third_party/ittapi' 2025-12-04T09:22:00.5341452Z Synchronizing submodule url for 'third_party/kineto' 2025-12-04T09:22:00.5372987Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:00.5424160Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:00.5464592Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:00.5506624Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:00.5547384Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:00.5585643Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:00.5630564Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:00.5669965Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:00.5706557Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:00.5750427Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:00.5787769Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:00.5826109Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:00.5866305Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:00.5916535Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:00.5950133Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:00.5990027Z Synchronizing submodule url for 'third_party/kleidiai' 2025-12-04T09:22:00.6034137Z Synchronizing submodule url for 'third_party/mimalloc' 2025-12-04T09:22:00.6084608Z Synchronizing submodule url for 'third_party/nlohmann' 2025-12-04T09:22:00.6132268Z Synchronizing submodule url for 'third_party/onnx' 2025-12-04T09:22:00.6185747Z Synchronizing submodule url for 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:00.6251540Z Synchronizing submodule url for 'third_party/opentelemetry-cpp' 2025-12-04T09:22:00.6292584Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:00.6332161Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:00.6373044Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:00.6404015Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:00.6439067Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:00.6477636Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:00.6515118Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:00.6549806Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:00.6588441Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:00.6630228Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:00.6691728Z Synchronizing submodule url for 'third_party/pocketfft' 2025-12-04T09:22:00.6732688Z Synchronizing submodule url for 'third_party/protobuf' 2025-12-04T09:22:00.6767066Z Synchronizing submodule url for 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:00.6805238Z Synchronizing submodule url for 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:00.6855746Z Synchronizing submodule url for 'third_party/psimd' 2025-12-04T09:22:00.6890479Z Synchronizing submodule url for 'third_party/pthreadpool' 2025-12-04T09:22:00.6930354Z Synchronizing submodule url for 'third_party/pybind11' 2025-12-04T09:22:00.6972941Z Synchronizing submodule url for 'third_party/python-peachpy' 2025-12-04T09:22:00.7009898Z Synchronizing submodule url for 'third_party/sleef' 2025-12-04T09:22:00.7050311Z Synchronizing submodule url for 'third_party/tensorpipe' 2025-12-04T09:22:00.7080204Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:00.7117582Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:00.7158844Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:00.7198966Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:00.7226149Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:00.7295158Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-12-04T09:22:00.7771135Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-12-04T09:22:00.8001999Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-12-04T09:22:00.8258122Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-12-04T09:22:00.8504280Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-12-04T09:22:00.8764986Z Submodule path 'third_party/NVTX': checked out '3ebbc93ded7285963bff932c678fa367eb393ba6' 2025-12-04T09:22:00.9004531Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-12-04T09:22:00.9418741Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-12-04T09:22:00.9811778Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-12-04T09:22:01.0209375Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-12-04T09:22:01.0491280Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-12-04T09:22:01.0931919Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T09:22:01.1195093Z Submodule path 'third_party/cpp-httplib': checked out '89c932f313c6437c38f2982869beacc89c2f2246' 2025-12-04T09:22:01.1452494Z Submodule path 'third_party/cpuinfo': checked out 'f858c30bcb16f8effd5ff46996f0514539e17abc' 2025-12-04T09:22:01.1740238Z Submodule path 'third_party/cudnn_frontend': checked out '0b1577c8c83401237d601d0d0db5210506705396' 2025-12-04T09:22:01.5991605Z From https://github.com/NVIDIA/cutlass 2025-12-04T09:22:01.5992224Z ec8daf64..52ae719e main -> origin/main 2025-12-04T09:22:01.5994518Z 5fd9685d..f88806b1 release/4.3 -> origin/release/4.3 2025-12-04T09:22:01.5995699Z * [new tag] v4.3.1 -> v4.3.1 2025-12-04T09:22:01.6281118Z Submodule path 'third_party/cutlass': checked out 'f88806b1e31dfa579842638740216dd41fc6c588' 2025-12-04T09:22:01.6651742Z Submodule path 'third_party/fbgemm': checked out 'c0b988d39a9e47c794d699f29930ed4d7c7e13a4' 2025-12-04T09:22:01.6873729Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-12-04T09:22:01.7287559Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T09:22:01.7540393Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-12-04T09:22:01.7859969Z Submodule path 'third_party/fbgemm/external/cutlass': checked out '98125ce499b0fdf7ffbe0e3052f5b8709f4840f8' 2025-12-04T09:22:01.8117875Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:22:01.8355404Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out '63b6a7b541fa7f08f8475ca7d74054db36ff2691' 2025-12-04T09:22:01.8654595Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-12-04T09:22:01.8958034Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-12-04T09:22:01.9370606Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-12-04T09:22:01.9673101Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-12-04T09:22:02.0017746Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-12-04T09:22:02.0290917Z Submodule path 'third_party/fmt': checked out '407c905e45ad75fc29bf0f9bb7c5c2fd3475976f' 2025-12-04T09:22:02.0555872Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-12-04T09:22:02.0824826Z Submodule path 'third_party/gloo': checked out '54cbae0d3a67fa890b4c3d9ee162b7860315e341' 2025-12-04T09:22:02.1066127Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:22:02.1310992Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-12-04T09:22:02.1710105Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-12-04T09:22:02.1997837Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-12-04T09:22:02.2273010Z Submodule path 'third_party/kineto': checked out '31f85df8fbd89c188f14ef10f1ec65379786b943' 2025-12-04T09:22:02.2548041Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out 'd2ffe0a4e3acace628db49974246b66fc3e85fb1' 2025-12-04T09:22:02.2812698Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-12-04T09:22:02.3066812Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-12-04T09:22:02.3317082Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-12-04T09:22:02.3574012Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-12-04T09:22:02.3812278Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-12-04T09:22:02.4064626Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-12-04T09:22:02.4312268Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:22:02.4579236Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-12-04T09:22:02.4796002Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-12-04T09:22:02.5045909Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp': checked out 'b1234816facfdda29845c46696a02998a4af115a' 2025-12-04T09:22:02.5322824Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'd7ba35bbb649209c66e582d5a0244ba988a15159' 2025-12-04T09:22:02.5561353Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T09:22:02.5817722Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-12-04T09:22:02.6057823Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:22:02.6360924Z Submodule path 'third_party/kleidiai': checked out 'd7770c89632329a9914ef1a90289917597639cbe' 2025-12-04T09:22:02.6634923Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-12-04T09:22:02.6938807Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-12-04T09:22:02.7300609Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-12-04T09:22:02.7576558Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-12-04T09:22:02.7903593Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-12-04T09:22:02.8143169Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-12-04T09:22:02.8373967Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-12-04T09:22:02.8596160Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-12-04T09:22:02.8887222Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-12-04T09:22:02.9142845Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-12-04T09:22:02.9364118Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-12-04T09:22:02.9603675Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-12-04T09:22:02.9862907Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-12-04T09:22:03.0118005Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T09:22:03.0495537Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-12-04T09:22:03.0781647Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-12-04T09:22:03.1158765Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-12-04T09:22:03.1406223Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-12-04T09:22:03.1643271Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-12-04T09:22:03.1891663Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-12-04T09:22:03.2140846Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-12-04T09:22:03.2399816Z Submodule path 'third_party/pybind11': checked out 'f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8' 2025-12-04T09:22:03.2652056Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-12-04T09:22:03.2899182Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-12-04T09:22:03.3130776Z Submodule path 'third_party/tensorpipe': checked out '2b4cd91092d335a697416b2a3cb398283246849d' 2025-12-04T09:22:03.3356793Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-12-04T09:22:03.3595646Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-12-04T09:22:03.3940490Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-12-04T09:22:03.4201390Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-12-04T09:22:03.4412289Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-12-04T09:22:03.4504317Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-12-04T09:22:03.4781398Z Entering 'android/libs/fbjni' 2025-12-04T09:22:03.4823569Z Entering 'third_party/FP16' 2025-12-04T09:22:03.4861280Z Entering 'third_party/FXdiv' 2025-12-04T09:22:03.4900021Z Entering 'third_party/NNPACK' 2025-12-04T09:22:03.4941405Z Entering 'third_party/NVTX' 2025-12-04T09:22:03.4978465Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:03.5028031Z Entering 'third_party/XNNPACK' 2025-12-04T09:22:03.5069960Z Entering 'third_party/aiter' 2025-12-04T09:22:03.5110336Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:03.5160511Z Entering 'third_party/benchmark' 2025-12-04T09:22:03.5198062Z Entering 'third_party/composable_kernel' 2025-12-04T09:22:03.5238935Z Entering 'third_party/cpp-httplib' 2025-12-04T09:22:03.5279844Z Entering 'third_party/cpuinfo' 2025-12-04T09:22:03.5320955Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:22:03.5359372Z Entering 'third_party/cutlass' 2025-12-04T09:22:03.5410955Z Entering 'third_party/fbgemm' 2025-12-04T09:22:03.5451883Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:03.5507304Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:03.5557549Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:03.5592368Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:03.5649283Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:03.5695159Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:03.5729646Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:22:03.5774188Z Entering 'third_party/flash-attention' 2025-12-04T09:22:03.5813624Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:03.5858726Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:03.5902490Z Entering 'third_party/flatbuffers' 2025-12-04T09:22:03.5945721Z Entering 'third_party/fmt' 2025-12-04T09:22:03.5992421Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:03.6033284Z Entering 'third_party/gloo' 2025-12-04T09:22:03.6073482Z Entering 'third_party/googletest' 2025-12-04T09:22:03.6113860Z Entering 'third_party/ideep' 2025-12-04T09:22:03.6161904Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:03.6209028Z Entering 'third_party/ittapi' 2025-12-04T09:22:03.6243248Z Entering 'third_party/kineto' 2025-12-04T09:22:03.6290418Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:03.6331355Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:03.6372671Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:03.6409215Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:03.6452366Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:03.6491596Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:03.6535133Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:03.6581579Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:03.6633090Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:03.6674624Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:03.6713758Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:03.6752799Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:03.6793836Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:03.6839618Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:03.6872916Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:03.6917127Z Entering 'third_party/kleidiai' 2025-12-04T09:22:03.6962854Z Entering 'third_party/mimalloc' 2025-12-04T09:22:03.7002795Z Entering 'third_party/nlohmann' 2025-12-04T09:22:03.7052312Z Entering 'third_party/onnx' 2025-12-04T09:22:03.7107387Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:03.7166454Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:22:03.7215212Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:03.7264608Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:03.7305436Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:03.7344724Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:03.7393933Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:03.7442899Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:03.7492228Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:03.7532926Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:03.7576146Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:03.7626491Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:03.7693668Z Entering 'third_party/pocketfft' 2025-12-04T09:22:03.7742450Z Entering 'third_party/protobuf' 2025-12-04T09:22:03.7795472Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:03.7843878Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:03.7897460Z Entering 'third_party/psimd' 2025-12-04T09:22:03.7944293Z Entering 'third_party/pthreadpool' 2025-12-04T09:22:03.7994570Z Entering 'third_party/pybind11' 2025-12-04T09:22:03.8029893Z Entering 'third_party/python-peachpy' 2025-12-04T09:22:03.8063030Z Entering 'third_party/sleef' 2025-12-04T09:22:03.8115696Z Entering 'third_party/tensorpipe' 2025-12-04T09:22:03.8163484Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:03.8208079Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:03.8244093Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:03.8294811Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:03.8339607Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:03.8410382Z ##[endgroup] 2025-12-04T09:22:03.8410731Z ##[group]Persisting credentials for submodules 2025-12-04T09:22:03.8416153Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-12-04T09:22:03.8689837Z Entering 'android/libs/fbjni' 2025-12-04T09:22:03.8722649Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8722961Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8766703Z Entering 'third_party/FP16' 2025-12-04T09:22:03.8796131Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8796349Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8833780Z Entering 'third_party/FXdiv' 2025-12-04T09:22:03.8855885Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8856125Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8891984Z Entering 'third_party/NNPACK' 2025-12-04T09:22:03.8918835Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8919146Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8957291Z Entering 'third_party/NVTX' 2025-12-04T09:22:03.8983398Z url.https://github.com/.insteadof 2025-12-04T09:22:03.8983613Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9018620Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:03.9042819Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9043040Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9077199Z Entering 'third_party/XNNPACK' 2025-12-04T09:22:03.9114440Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9114805Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9158427Z Entering 'third_party/aiter' 2025-12-04T09:22:03.9193953Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9194578Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9225658Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:03.9245976Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9246354Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9294149Z Entering 'third_party/benchmark' 2025-12-04T09:22:03.9321417Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9322023Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9371133Z Entering 'third_party/composable_kernel' 2025-12-04T09:22:03.9402683Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9404808Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9446698Z Entering 'third_party/cpp-httplib' 2025-12-04T09:22:03.9474683Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9474964Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9507033Z Entering 'third_party/cpuinfo' 2025-12-04T09:22:03.9526966Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9527450Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9570287Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:22:03.9599798Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9600305Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9634042Z Entering 'third_party/cutlass' 2025-12-04T09:22:03.9654687Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9655193Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9697808Z Entering 'third_party/fbgemm' 2025-12-04T09:22:03.9729397Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9729906Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9769674Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:03.9794383Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9795446Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9831505Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:03.9849344Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9849582Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9893439Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:03.9916323Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9916608Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9953098Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:03.9972752Z url.https://github.com/.insteadof 2025-12-04T09:22:03.9972986Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0016241Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:04.0046138Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0046760Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0079047Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:04.0108656Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0108949Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0139095Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:22:04.0166437Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0166984Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0206600Z Entering 'third_party/flash-attention' 2025-12-04T09:22:04.0233511Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0233796Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0275544Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:04.0294675Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0295185Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0331598Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:04.0360713Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0361219Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0409771Z Entering 'third_party/flatbuffers' 2025-12-04T09:22:04.0438464Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0438966Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0480690Z Entering 'third_party/fmt' 2025-12-04T09:22:04.0510987Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0511504Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0548179Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:04.0570515Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0571034Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0609255Z Entering 'third_party/gloo' 2025-12-04T09:22:04.0632322Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0632545Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0671073Z Entering 'third_party/googletest' 2025-12-04T09:22:04.0704564Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0706076Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0736947Z Entering 'third_party/ideep' 2025-12-04T09:22:04.0755887Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0757524Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0790941Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:04.0824374Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0826916Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0871755Z Entering 'third_party/ittapi' 2025-12-04T09:22:04.0905727Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0906007Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0941923Z Entering 'third_party/kineto' 2025-12-04T09:22:04.0963297Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0963574Z url.https://github.com/.insteadof 2025-12-04T09:22:04.0997043Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:04.1028089Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1028593Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1060899Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:04.1089178Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1089691Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1122028Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:04.1146523Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1147046Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1180446Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:04.1206152Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1207187Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1251091Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:04.1270872Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1271111Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1303680Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:04.1329854Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1330354Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1369972Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:04.1391531Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1392025Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1430062Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:04.1451529Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1452025Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1490108Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:04.1518689Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1519710Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1553344Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:04.1580531Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1581025Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1613939Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:04.1645724Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1646314Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1681440Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:04.1710558Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1711682Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1749717Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:04.1772780Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1773276Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1820168Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:04.1838850Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1839134Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1873237Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:04.1897820Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1898116Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1930862Z Entering 'third_party/kleidiai' 2025-12-04T09:22:04.1955727Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1957553Z url.https://github.com/.insteadof 2025-12-04T09:22:04.1992963Z Entering 'third_party/mimalloc' 2025-12-04T09:22:04.2014467Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2014975Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2051019Z Entering 'third_party/nlohmann' 2025-12-04T09:22:04.2077600Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2078211Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2112630Z Entering 'third_party/onnx' 2025-12-04T09:22:04.2141106Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2141383Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2186373Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:04.2209027Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2209300Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2242574Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:22:04.2260040Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2260283Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2292200Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:04.2320103Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2320553Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2353520Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:04.2376362Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2376728Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2411502Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:04.2437813Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2438149Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2465192Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:04.2482907Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2483085Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2523406Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:04.2550726Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2551060Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2583165Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:04.2610506Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2610739Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2646914Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:04.2671119Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2671342Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2697218Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:04.2722515Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2722728Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2757577Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:04.2781007Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2781509Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2815927Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:04.2842915Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2843204Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2892292Z Entering 'third_party/pocketfft' 2025-12-04T09:22:04.2921845Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2922362Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2954190Z Entering 'third_party/protobuf' 2025-12-04T09:22:04.2983978Z url.https://github.com/.insteadof 2025-12-04T09:22:04.2984485Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3024144Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:04.3049321Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3049822Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3079002Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:04.3102693Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3103204Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3141256Z Entering 'third_party/psimd' 2025-12-04T09:22:04.3170144Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3170651Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3204099Z Entering 'third_party/pthreadpool' 2025-12-04T09:22:04.3233691Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3234220Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3272997Z Entering 'third_party/pybind11' 2025-12-04T09:22:04.3302250Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3302756Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3335589Z Entering 'third_party/python-peachpy' 2025-12-04T09:22:04.3363871Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3364388Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3405545Z Entering 'third_party/sleef' 2025-12-04T09:22:04.3431832Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3432342Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3466579Z Entering 'third_party/tensorpipe' 2025-12-04T09:22:04.3498230Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3498516Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3531378Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:04.3551535Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3551848Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3590630Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:04.3609148Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3609437Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3646724Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:04.3676730Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3676952Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3713689Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:04.3741030Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3741472Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3779578Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:04.3808676Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3808890Z url.https://github.com/.insteadof 2025-12-04T09:22:04.3884393Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-12-04T09:22:04.4125315Z Entering 'android/libs/fbjni' 2025-12-04T09:22:04.4172185Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T09:22:04.4192587Z Entering 'third_party/FP16' 2025-12-04T09:22:04.4233179Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T09:22:04.4256712Z Entering 'third_party/FXdiv' 2025-12-04T09:22:04.4296769Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T09:22:04.4317946Z Entering 'third_party/NNPACK' 2025-12-04T09:22:04.4359941Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T09:22:04.4379831Z Entering 'third_party/NVTX' 2025-12-04T09:22:04.4416297Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T09:22:04.4439647Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:04.4486747Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T09:22:04.4509016Z Entering 'third_party/XNNPACK' 2025-12-04T09:22:04.4551816Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T09:22:04.4584377Z Entering 'third_party/aiter' 2025-12-04T09:22:04.4627468Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T09:22:04.4648983Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:04.4685822Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T09:22:04.4710218Z Entering 'third_party/benchmark' 2025-12-04T09:22:04.4752588Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:22:04.4775391Z Entering 'third_party/composable_kernel' 2025-12-04T09:22:04.4824463Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T09:22:04.4850453Z Entering 'third_party/cpp-httplib' 2025-12-04T09:22:04.4898279Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T09:22:04.4919156Z Entering 'third_party/cpuinfo' 2025-12-04T09:22:04.4966109Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T09:22:04.4989989Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:22:04.5027494Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T09:22:04.5050425Z Entering 'third_party/cutlass' 2025-12-04T09:22:04.5096183Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T09:22:04.5126508Z Entering 'third_party/fbgemm' 2025-12-04T09:22:04.5170059Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T09:22:04.5190611Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:04.5228123Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T09:22:04.5249891Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:04.5286426Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T09:22:04.5309542Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:04.5347058Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T09:22:04.5371051Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:04.5411683Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T09:22:04.5436542Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:04.5478020Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T09:22:04.5497274Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:04.5538026Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T09:22:04.5557967Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:22:04.5594421Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T09:22:04.5623681Z Entering 'third_party/flash-attention' 2025-12-04T09:22:04.5667736Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T09:22:04.5689334Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:04.5736320Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T09:22:04.5758441Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:04.5796462Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T09:22:04.5822361Z Entering 'third_party/flatbuffers' 2025-12-04T09:22:04.5866999Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T09:22:04.5891875Z Entering 'third_party/fmt' 2025-12-04T09:22:04.5929737Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T09:22:04.5946838Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:04.5987115Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T09:22:04.6004467Z Entering 'third_party/gloo' 2025-12-04T09:22:04.6044460Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T09:22:04.6069801Z Entering 'third_party/googletest' 2025-12-04T09:22:04.6108154Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:22:04.6130213Z Entering 'third_party/ideep' 2025-12-04T09:22:04.6178473Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T09:22:04.6194661Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:04.6230139Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T09:22:04.6260960Z Entering 'third_party/ittapi' 2025-12-04T09:22:04.6300549Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T09:22:04.6323112Z Entering 'third_party/kineto' 2025-12-04T09:22:04.6359476Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T09:22:04.6380313Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:04.6421038Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T09:22:04.6445211Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:04.6491191Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T09:22:04.6513348Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:04.6547595Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T09:22:04.6570002Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:04.6610274Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T09:22:04.6634335Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:04.6684083Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T09:22:04.6696428Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:04.6740437Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T09:22:04.6760435Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:04.6791084Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T09:22:04.6810283Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:04.6850822Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:22:04.6867445Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:04.6899889Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T09:22:04.6916764Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:04.6961571Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T09:22:04.6978245Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:04.7010831Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T09:22:04.7025118Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:04.7062647Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T09:22:04.7083059Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:04.7118683Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T09:22:04.7141576Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:04.7180025Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T09:22:04.7200584Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:04.7236192Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T09:22:04.7258149Z Entering 'third_party/kleidiai' 2025-12-04T09:22:04.7290485Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T09:22:04.7309681Z Entering 'third_party/mimalloc' 2025-12-04T09:22:04.7341182Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T09:22:04.7362474Z Entering 'third_party/nlohmann' 2025-12-04T09:22:04.7400819Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T09:22:04.7429222Z Entering 'third_party/onnx' 2025-12-04T09:22:04.7472999Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T09:22:04.7504395Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:04.7551920Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:22:04.7578503Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:22:04.7621458Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T09:22:04.7644473Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:04.7680992Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:22:04.7697635Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:04.7730903Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:22:04.7753098Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:04.7788571Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T09:22:04.7811784Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:04.7852522Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T09:22:04.7874383Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:04.7912705Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T09:22:04.7928159Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:04.7962317Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T09:22:04.7984196Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:04.8020905Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T09:22:04.8042897Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:04.8091197Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T09:22:04.8110693Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:04.8144619Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T09:22:04.8167542Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:04.8200923Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T09:22:04.8238055Z Entering 'third_party/pocketfft' 2025-12-04T09:22:04.8281939Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T09:22:04.8304780Z Entering 'third_party/protobuf' 2025-12-04T09:22:04.8343139Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T09:22:04.8371212Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:04.8412331Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:22:04.8428470Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:04.8473539Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:22:04.8491230Z Entering 'third_party/psimd' 2025-12-04T09:22:04.8533021Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T09:22:04.8551588Z Entering 'third_party/pthreadpool' 2025-12-04T09:22:04.8590429Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T09:22:04.8619228Z Entering 'third_party/pybind11' 2025-12-04T09:22:04.8659647Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:22:04.8680827Z Entering 'third_party/python-peachpy' 2025-12-04T09:22:04.8715411Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T09:22:04.8736481Z Entering 'third_party/sleef' 2025-12-04T09:22:04.8783050Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T09:22:04.8805938Z Entering 'third_party/tensorpipe' 2025-12-04T09:22:04.8843089Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T09:22:04.8865739Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:04.8905951Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:22:04.8928377Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:04.8967052Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T09:22:04.8985976Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:04.9033075Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T09:22:04.9050740Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:04.9091531Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:22:04.9111703Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:04.9153871Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T09:22:04.9472333Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-12-04T09:22:04.9743362Z Entering 'android/libs/fbjni' 2025-12-04T09:22:04.9790010Z Entering 'third_party/FP16' 2025-12-04T09:22:04.9837391Z Entering 'third_party/FXdiv' 2025-12-04T09:22:04.9876101Z Entering 'third_party/NNPACK' 2025-12-04T09:22:04.9917931Z Entering 'third_party/NVTX' 2025-12-04T09:22:04.9952389Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:04.9988526Z Entering 'third_party/XNNPACK' 2025-12-04T09:22:05.0031355Z Entering 'third_party/aiter' 2025-12-04T09:22:05.0069815Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:05.0124922Z Entering 'third_party/benchmark' 2025-12-04T09:22:05.0159535Z Entering 'third_party/composable_kernel' 2025-12-04T09:22:05.0211271Z Entering 'third_party/cpp-httplib' 2025-12-04T09:22:05.0250270Z Entering 'third_party/cpuinfo' 2025-12-04T09:22:05.0290355Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:22:05.0324046Z Entering 'third_party/cutlass' 2025-12-04T09:22:05.0370897Z Entering 'third_party/fbgemm' 2025-12-04T09:22:05.0411091Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:05.0450863Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:05.0493474Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:05.0538943Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:05.0587076Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:05.0616298Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:05.0650467Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:22:05.0691199Z Entering 'third_party/flash-attention' 2025-12-04T09:22:05.0737524Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:05.0786741Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:05.0839042Z Entering 'third_party/flatbuffers' 2025-12-04T09:22:05.0891747Z Entering 'third_party/fmt' 2025-12-04T09:22:05.0925966Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:05.0969947Z Entering 'third_party/gloo' 2025-12-04T09:22:05.1009787Z Entering 'third_party/googletest' 2025-12-04T09:22:05.1049091Z Entering 'third_party/ideep' 2025-12-04T09:22:05.1086918Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:05.1129685Z Entering 'third_party/ittapi' 2025-12-04T09:22:05.1169776Z Entering 'third_party/kineto' 2025-12-04T09:22:05.1213825Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:05.1272516Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:05.1318196Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:05.1351927Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:05.1402013Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:05.1450475Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:05.1493068Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:05.1547816Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:05.1578882Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:05.1614096Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:05.1646347Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:05.1684133Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:05.1733529Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:05.1787701Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:05.1822569Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:05.1875581Z Entering 'third_party/kleidiai' 2025-12-04T09:22:05.1915490Z Entering 'third_party/mimalloc' 2025-12-04T09:22:05.1954974Z Entering 'third_party/nlohmann' 2025-12-04T09:22:05.1990397Z Entering 'third_party/onnx' 2025-12-04T09:22:05.2043087Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:05.2091101Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:22:05.2132620Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:05.2170695Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:05.2200282Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:05.2239475Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:05.2273238Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:05.2311371Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:05.2349889Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:05.2379689Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:05.2423442Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:05.2464198Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:05.2517182Z Entering 'third_party/pocketfft' 2025-12-04T09:22:05.2553203Z Entering 'third_party/protobuf' 2025-12-04T09:22:05.2594983Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:05.2631984Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:05.2674494Z Entering 'third_party/psimd' 2025-12-04T09:22:05.2710707Z Entering 'third_party/pthreadpool' 2025-12-04T09:22:05.2753808Z Entering 'third_party/pybind11' 2025-12-04T09:22:05.2794968Z Entering 'third_party/python-peachpy' 2025-12-04T09:22:05.2840744Z Entering 'third_party/sleef' 2025-12-04T09:22:05.2884058Z Entering 'third_party/tensorpipe' 2025-12-04T09:22:05.2929374Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:05.2972608Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:05.3014639Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:05.3053057Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:05.3115248Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:05.3198834Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-12-04T09:22:05.3458336Z Entering 'android/libs/fbjni' 2025-12-04T09:22:05.3505815Z Entering 'third_party/FP16' 2025-12-04T09:22:05.3548732Z Entering 'third_party/FXdiv' 2025-12-04T09:22:05.3614925Z Entering 'third_party/NNPACK' 2025-12-04T09:22:05.3627944Z Entering 'third_party/NVTX' 2025-12-04T09:22:05.3665754Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:05.3707001Z Entering 'third_party/XNNPACK' 2025-12-04T09:22:05.3759705Z Entering 'third_party/aiter' 2025-12-04T09:22:05.3795753Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:05.3843800Z Entering 'third_party/benchmark' 2025-12-04T09:22:05.3885824Z Entering 'third_party/composable_kernel' 2025-12-04T09:22:05.3941677Z Entering 'third_party/cpp-httplib' 2025-12-04T09:22:05.3984984Z Entering 'third_party/cpuinfo' 2025-12-04T09:22:05.4035940Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:22:05.4073948Z Entering 'third_party/cutlass' 2025-12-04T09:22:05.4131475Z Entering 'third_party/fbgemm' 2025-12-04T09:22:05.4169556Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:05.4213445Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:05.4262561Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:05.4294097Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:05.4332211Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:05.4362985Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:05.4394850Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:22:05.4432825Z Entering 'third_party/flash-attention' 2025-12-04T09:22:05.4467789Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:05.4510259Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:05.4551122Z Entering 'third_party/flatbuffers' 2025-12-04T09:22:05.4588776Z Entering 'third_party/fmt' 2025-12-04T09:22:05.4627087Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:05.4665131Z Entering 'third_party/gloo' 2025-12-04T09:22:05.4705508Z Entering 'third_party/googletest' 2025-12-04T09:22:05.4747196Z Entering 'third_party/ideep' 2025-12-04T09:22:05.4785258Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:05.4832786Z Entering 'third_party/ittapi' 2025-12-04T09:22:05.4867272Z Entering 'third_party/kineto' 2025-12-04T09:22:05.4914685Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:05.4955246Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:05.4987560Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:05.5025495Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:05.5058800Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:05.5098599Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:05.5141516Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:05.5179068Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:05.5219612Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:05.5259005Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:05.5299016Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:05.5336546Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:05.5370378Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:05.5423978Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:05.5459974Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:05.5502662Z Entering 'third_party/kleidiai' 2025-12-04T09:22:05.5541419Z Entering 'third_party/mimalloc' 2025-12-04T09:22:05.5588352Z Entering 'third_party/nlohmann' 2025-12-04T09:22:05.5628768Z Entering 'third_party/onnx' 2025-12-04T09:22:05.5680072Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:05.5724543Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:22:05.5769325Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:05.5817324Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:05.5857437Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:05.5883592Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:05.5919997Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:05.5960033Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:05.5998681Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:05.6040266Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:05.6079684Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:05.6121127Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:05.6176668Z Entering 'third_party/pocketfft' 2025-12-04T09:22:05.6211452Z Entering 'third_party/protobuf' 2025-12-04T09:22:05.6260148Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:05.6299664Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:05.6351635Z Entering 'third_party/psimd' 2025-12-04T09:22:05.6398720Z Entering 'third_party/pthreadpool' 2025-12-04T09:22:05.6439092Z Entering 'third_party/pybind11' 2025-12-04T09:22:05.6481518Z Entering 'third_party/python-peachpy' 2025-12-04T09:22:05.6529133Z Entering 'third_party/sleef' 2025-12-04T09:22:05.6569367Z Entering 'third_party/tensorpipe' 2025-12-04T09:22:05.6609446Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:05.6649353Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:05.6689382Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:05.6729987Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:05.6764836Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:05.6823447Z ##[endgroup] 2025-12-04T09:22:05.6868663Z [command]/usr/bin/git log -1 --format=%H 2025-12-04T09:22:05.6893830Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:22:05.6995646Z ##[group]Run cd "${GITHUB_WORKSPACE}" 2025-12-04T09:22:05.6996029Z cd "${GITHUB_WORKSPACE}" 2025-12-04T09:22:05.6996210Z # Clean stale submodule dirs 2025-12-04T09:22:05.6996384Z if [ -z "${NO_SUDO}" ]; then 2025-12-04T09:22:05.6996593Z  sudo git submodule foreach --recursive git clean -ffdx 2025-12-04T09:22:05.6996808Z else 2025-12-04T09:22:05.6996972Z  git submodule foreach --recursive git clean -ffdx 2025-12-04T09:22:05.6997172Z fi 2025-12-04T09:22:05.7008087Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:05.7008316Z env: 2025-12-04T09:22:05.7008445Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:05.7008593Z NO_SUDO: true 2025-12-04T09:22:05.7008728Z ##[endgroup] 2025-12-04T09:22:05.7277917Z Entering 'android/libs/fbjni' 2025-12-04T09:22:05.7312861Z Entering 'third_party/FP16' 2025-12-04T09:22:05.7349983Z Entering 'third_party/FXdiv' 2025-12-04T09:22:05.7389934Z Entering 'third_party/NNPACK' 2025-12-04T09:22:05.7432311Z Entering 'third_party/NVTX' 2025-12-04T09:22:05.7491369Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:22:05.7522371Z Entering 'third_party/XNNPACK' 2025-12-04T09:22:05.7662340Z Entering 'third_party/aiter' 2025-12-04T09:22:05.7712786Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:22:05.7840999Z Entering 'third_party/benchmark' 2025-12-04T09:22:05.7873772Z Entering 'third_party/composable_kernel' 2025-12-04T09:22:05.8001142Z Entering 'third_party/cpp-httplib' 2025-12-04T09:22:05.8042479Z Entering 'third_party/cpuinfo' 2025-12-04T09:22:05.8084511Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:22:05.8123777Z Entering 'third_party/cutlass' 2025-12-04T09:22:05.8240468Z Entering 'third_party/fbgemm' 2025-12-04T09:22:05.8306472Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:22:05.8343266Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:22:05.8464143Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:22:05.8505309Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:22:05.8621591Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:22:05.8663491Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:22:05.8705517Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:22:05.8758473Z Entering 'third_party/flash-attention' 2025-12-04T09:22:05.8804081Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:22:05.8912503Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:22:05.9013441Z Entering 'third_party/flatbuffers' 2025-12-04T09:22:05.9097366Z Entering 'third_party/fmt' 2025-12-04T09:22:05.9135257Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:22:05.9168553Z Entering 'third_party/gloo' 2025-12-04T09:22:05.9205199Z Entering 'third_party/googletest' 2025-12-04T09:22:05.9244603Z Entering 'third_party/ideep' 2025-12-04T09:22:05.9281409Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:22:05.9381820Z Entering 'third_party/ittapi' 2025-12-04T09:22:05.9416975Z Entering 'third_party/kineto' 2025-12-04T09:22:05.9456258Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:22:05.9495763Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:22:05.9547154Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:22:05.9583251Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:22:05.9618176Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:22:05.9655907Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:22:05.9700484Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:22:05.9736497Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:22:05.9782582Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:22:05.9825053Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:22:05.9848705Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:22:05.9884115Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:05.9939331Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:05.9984764Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:22:06.0018319Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:22:06.0062143Z Entering 'third_party/kleidiai' 2025-12-04T09:22:06.0105099Z Entering 'third_party/mimalloc' 2025-12-04T09:22:06.0145604Z Entering 'third_party/nlohmann' 2025-12-04T09:22:06.0198063Z Entering 'third_party/onnx' 2025-12-04T09:22:06.0566127Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:22:06.0612302Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:22:06.0668208Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:22:06.0705661Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:22:06.0738970Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:22:06.0778607Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:22:06.0821691Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:22:06.0859536Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:22:06.0883599Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:22:06.0917341Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:22:06.0971251Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:22:06.1012068Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:22:06.1315074Z Entering 'third_party/pocketfft' 2025-12-04T09:22:06.1348626Z Entering 'third_party/protobuf' 2025-12-04T09:22:06.1432513Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:22:06.1470248Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:22:06.1511862Z Entering 'third_party/psimd' 2025-12-04T09:22:06.1548578Z Entering 'third_party/pthreadpool' 2025-12-04T09:22:06.1588737Z Entering 'third_party/pybind11' 2025-12-04T09:22:06.1626119Z Entering 'third_party/python-peachpy' 2025-12-04T09:22:06.1660123Z Entering 'third_party/sleef' 2025-12-04T09:22:06.1699424Z Entering 'third_party/tensorpipe' 2025-12-04T09:22:06.1740165Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:22:06.1779575Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:22:06.1819261Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:22:06.1851717Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:22:06.1888668Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:22:06.2020972Z Prepare all required actions 2025-12-04T09:22:06.2021330Z Getting action download info 2025-12-04T09:22:06.4189418Z Download action repository 'aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2025-12-04T09:22:06.8751318Z ##[group]Run ./.github/actions/setup-rocm 2025-12-04T09:22:06.8751623Z env: 2025-12-04T09:22:06.8751826Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:06.8752066Z ##[endgroup] 2025-12-04T09:22:06.8778426Z ##[group]Run dpkg -l | grep -E " rocm" 2025-12-04T09:22:06.8778761Z dpkg -l | grep -E " rocm" 2025-12-04T09:22:06.8794929Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:06.8795286Z env: 2025-12-04T09:22:06.8795486Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:06.8795720Z ##[endgroup] 2025-12-04T09:22:06.8940770Z ii rocm 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) software stack meta package 2025-12-04T09:22:06.8941750Z ii rocm-cmake 0.14.0.70000-38~22.04 amd64 rocm-cmake built using CMake 2025-12-04T09:22:06.8942564Z ii rocm-core 7.0.0.70000-38~22.04 amd64 ROCm Runtime software stack 2025-12-04T09:22:06.8943409Z ii rocm-dbgapi 0.77.3.70000-38~22.04 amd64 Library to provide AMD GPU debugger API 2025-12-04T09:22:06.8944393Z ii rocm-debug-agent 2.1.0.70000-38~22.04 amd64 Radeon Open Compute Debug Agent (ROCdebug-agent) 2025-12-04T09:22:06.8945516Z ii rocm-developer-tools 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T09:22:06.8946383Z ii rocm-device-libs 1.0.0.70000-38~22.04 amd64 Radeon Open Compute - device libraries 2025-12-04T09:22:06.8946981Z ii rocm-gdb 16.3.70000-38~22.04 amd64 ROCgdb 2025-12-04T09:22:06.8947314Z ii rocm-hip 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T09:22:06.8947706Z ii rocm-hip-runtime 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T09:22:06.8948123Z ii rocm-hip-runtime-dev 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T09:22:06.8948551Z ii rocm-language-runtime 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T09:22:06.8948940Z ii rocm-llvm 20.0.0.25314.70000-38~22.04 amd64 ROCm core compiler 2025-12-04T09:22:06.8949282Z ii rocm-opencl 2.0.0.70000-38~22.04 amd64 clr built using CMake 2025-12-04T09:22:06.8949623Z ii rocm-opencl-dev 2.0.0.70000-38~22.04 amd64 clr built using CMake 2025-12-04T09:22:06.8950016Z ii rocm-opencl-sdk 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T09:22:06.8950447Z ii rocm-openmp 7.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) OpenMP Software development Kit. 2025-12-04T09:22:06.8950848Z ii rocm-smi-lib 7.8.0.70000-38~22.04 amd64 AMD System Management libraries 2025-12-04T09:22:06.8951218Z ii rocminfo 1.0.0.70000-38~22.04 amd64 Radeon Open Compute (ROCm) Runtime rocminfo tool 2025-12-04T09:22:06.8966396Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T09:22:06.8966911Z # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T09:22:06.8967151Z # shellcheck disable=SC2046 2025-12-04T09:22:06.8967337Z docker stop $(docker ps -q) || true 2025-12-04T09:22:06.8967536Z # Prune all stopped containers. 2025-12-04T09:22:06.8967725Z docker container prune -f 2025-12-04T09:22:06.8979280Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:06.8979507Z env: 2025-12-04T09:22:06.8979636Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:06.8979794Z ##[endgroup] 2025-12-04T09:22:17.3932175Z ffb857a76e4b 2025-12-04T09:22:23.3922639Z Deleted Containers: 2025-12-04T09:22:23.3923380Z ffb857a76e4b94013d515e86d438cd09f246ae6e62e13e40a014d18a34fe8ee9 2025-12-04T09:22:23.3923873Z 2025-12-04T09:22:23.3924222Z Total reclaimed space: 13.13GB 2025-12-04T09:22:23.3985653Z ##[group]Run cat /etc/os-release || true 2025-12-04T09:22:23.3985981Z cat /etc/os-release || true 2025-12-04T09:22:23.3986270Z cat /etc/apt/sources.list.d/rocm.list || true 2025-12-04T09:22:23.3986553Z cat /opt/rocm/.info/version || true 2025-12-04T09:22:23.3986786Z whoami 2025-12-04T09:22:23.4008068Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:23.4008530Z env: 2025-12-04T09:22:23.4008793Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:23.4009089Z ##[endgroup] 2025-12-04T09:22:23.4060069Z PRETTY_NAME="Ubuntu 22.04 LTS" 2025-12-04T09:22:23.4060940Z NAME="Ubuntu" 2025-12-04T09:22:23.4061393Z VERSION_ID="22.04" 2025-12-04T09:22:23.4061783Z VERSION="22.04 (Jammy Jellyfish)" 2025-12-04T09:22:23.4062200Z VERSION_CODENAME=jammy 2025-12-04T09:22:23.4062533Z ID=ubuntu 2025-12-04T09:22:23.4062808Z ID_LIKE=debian 2025-12-04T09:22:23.4063190Z HOME_URL="https://www.ubuntu.com/" 2025-12-04T09:22:23.4063661Z SUPPORT_URL="https://help.ubuntu.com/" 2025-12-04T09:22:23.4064193Z BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" 2025-12-04T09:22:23.4065633Z PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" 2025-12-04T09:22:23.4066325Z UBUNTU_CODENAME=jammy 2025-12-04T09:22:23.4067358Z deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/7.0 jammy main 2025-12-04T09:22:23.4068498Z deb [arch=amd64,i386 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/graphics/7.0/ubuntu jammy main 2025-12-04T09:22:23.4074659Z 7.0.0 2025-12-04T09:22:23.4081945Z pytorchci 2025-12-04T09:22:23.4103161Z ##[group]Run dpkg -l | grep -E " amdgpu" 2025-12-04T09:22:23.4103402Z dpkg -l | grep -E " amdgpu" 2025-12-04T09:22:23.4114701Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:23.4114919Z env: 2025-12-04T09:22:23.4115049Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:23.4115198Z ##[endgroup] 2025-12-04T09:22:23.4254766Z ii amdgpu-core 1:7.0.70000-2204044.22.04 all Core meta package for unified amdgpu driver. 2025-12-04T09:22:23.4255329Z ii amdgpu-dkms 1:6.14.14.30100000-2204008.22.04 all amdgpu driver in DKMS format. 2025-12-04T09:22:23.4255905Z ii amdgpu-dkms-firmware 30.10.0.0.30100000-2204008.22.04 all firmware blobs used by amdgpu driver in DKMS format 2025-12-04T09:22:23.4256483Z ii amdgpu-install 30.10.0.0.30100000-2204008.22.04 all AMDGPU driver repository and installer 2025-12-04T09:22:23.4286059Z ##[group]Run rocm-smi 2025-12-04T09:22:23.4286540Z rocm-smi 2025-12-04T09:22:23.4311603Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:23.4312136Z env: 2025-12-04T09:22:23.4312440Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:23.4312796Z ##[endgroup] 2025-12-04T09:22:23.5138811Z WARNING: AMD GPUs visible, but data is inaccessible. Check power control/runtime_status 2025-12-04T09:22:23.5139155Z 2025-12-04T09:22:23.5184077Z 2025-12-04T09:22:23.5184123Z 2025-12-04T09:22:23.5185509Z ======================================== ROCm System Management Interface ======================================== 2025-12-04T09:22:23.5186322Z ================================================== Concise Info ================================================== 2025-12-04T09:22:23.5187141Z Device Node IDs Temp Power Partitions SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2025-12-04T09:22:23.5188433Z  (DID, GUID) (Edge) (Avg) (Mem, Compute, ID)  2025-12-04T09:22:23.5189128Z ================================================================================================================== 2025-12-04T09:22:23.5189779Z 0 1 0x744c, 43741 N/A N/A N/A, N/A, 0 N/A N/A 0% unknown N/A 0% 0% 2025-12-04T09:22:23.5190400Z ================================================================================================================== 2025-12-04T09:22:23.5190986Z ============================================== End of ROCm SMI Log =============================================== 2025-12-04T09:22:23.5270977Z ##[group]Run rocminfo 2025-12-04T09:22:23.5271337Z rocminfo 2025-12-04T09:22:23.5296743Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:23.5297275Z env: 2025-12-04T09:22:23.5297582Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:23.5297937Z ##[endgroup] 2025-12-04T09:22:24.2910630Z ROCk module version 6.14.14 is loaded 2025-12-04T09:22:24.2911574Z ===================== 2025-12-04T09:22:24.2912101Z HSA System Attributes 2025-12-04T09:22:24.2912482Z ===================== 2025-12-04T09:22:24.2912840Z Runtime Version: 1.18 2025-12-04T09:22:24.2913210Z Runtime Ext Version: 1.11 2025-12-04T09:22:24.2913602Z System Timestamp Freq.: 1000.000000MHz 2025-12-04T09:22:24.2914561Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2025-12-04T09:22:24.2915834Z Machine Model: LARGE 2025-12-04T09:22:24.2916139Z System Endianness: LITTLE 2025-12-04T09:22:24.2916406Z Mwaitx: DISABLED 2025-12-04T09:22:24.2916606Z XNACK enabled: NO 2025-12-04T09:22:24.2916806Z DMAbuf Support: YES 2025-12-04T09:22:24.2916997Z VMM Support: YES 2025-12-04T09:22:24.2917123Z 2025-12-04T09:22:24.2917185Z ========== 2025-12-04T09:22:24.2917361Z HSA Agents 2025-12-04T09:22:24.2917527Z ========== 2025-12-04T09:22:24.2917693Z ******* 2025-12-04T09:22:24.2917857Z Agent 1 2025-12-04T09:22:24.2918024Z ******* 2025-12-04T09:22:24.2918247Z Name: AMD Ryzen Threadripper PRO 5975WX 32-Cores 2025-12-04T09:22:24.2918552Z Uuid: CPU-XX 2025-12-04T09:22:24.2918866Z Marketing Name: AMD Ryzen Threadripper PRO 5975WX 32-Cores 2025-12-04T09:22:24.2919186Z Vendor Name: CPU 2025-12-04T09:22:24.2919466Z Feature: None specified 2025-12-04T09:22:24.2919743Z Profile: FULL_PROFILE 2025-12-04T09:22:24.2920022Z Float Round Mode: NEAR 2025-12-04T09:22:24.2920313Z Max Queue Number: 0(0x0) 2025-12-04T09:22:24.2920596Z Queue Min Size: 0(0x0) 2025-12-04T09:22:24.2920867Z Queue Max Size: 0(0x0) 2025-12-04T09:22:24.2921142Z Queue Type: MULTI 2025-12-04T09:22:24.2921404Z Node: 0 2025-12-04T09:22:24.2921661Z Device Type: CPU 2025-12-04T09:22:24.2921916Z Cache Info: 2025-12-04T09:22:24.2926500Z L1: 32768(0x8000) KB 2025-12-04T09:22:24.2927030Z Chip ID: 0(0x0) 2025-12-04T09:22:24.2927545Z ASIC Revision: 0(0x0) 2025-12-04T09:22:24.2928283Z Cacheline Size: 64(0x40) 2025-12-04T09:22:24.2928843Z Max Clock Freq. (MHz): 3600 2025-12-04T09:22:24.2929357Z BDFID: 0 2025-12-04T09:22:24.2929858Z Internal Node ID: 0 2025-12-04T09:22:24.2930402Z Compute Unit: 64 2025-12-04T09:22:24.2930925Z SIMDs per CU: 0 2025-12-04T09:22:24.2931446Z Shader Engines: 0 2025-12-04T09:22:24.2932000Z Shader Arrs. per Eng.: 0 2025-12-04T09:22:24.2932576Z WatchPts on Addr. Ranges:1 2025-12-04T09:22:24.2933077Z Memory Properties: 2025-12-04T09:22:24.2933446Z Features: None 2025-12-04T09:22:24.2933801Z Pool Info: 2025-12-04T09:22:24.2934147Z Pool 1 2025-12-04T09:22:24.2934588Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:22:24.2935123Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:22:24.2935643Z Allocatable: TRUE 2025-12-04T09:22:24.2936178Z Alloc Granule: 4KB 2025-12-04T09:22:24.2936760Z Alloc Recommended Granule:4KB 2025-12-04T09:22:24.2937352Z Alloc Alignment: 4KB 2025-12-04T09:22:24.2937911Z Accessible by all: TRUE 2025-12-04T09:22:24.2938659Z Pool 2 2025-12-04T09:22:24.2939100Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:22:24.2939615Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:22:24.2940128Z Allocatable: TRUE 2025-12-04T09:22:24.2940667Z Alloc Granule: 4KB 2025-12-04T09:22:24.2941240Z Alloc Recommended Granule:4KB 2025-12-04T09:22:24.2941811Z Alloc Alignment: 4KB 2025-12-04T09:22:24.2942362Z Accessible by all: TRUE 2025-12-04T09:22:24.2942843Z Pool 3 2025-12-04T09:22:24.2943269Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T09:22:24.2943780Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:22:24.2944292Z Allocatable: TRUE 2025-12-04T09:22:24.2944845Z Alloc Granule: 4KB 2025-12-04T09:22:24.2945411Z Alloc Recommended Granule:4KB 2025-12-04T09:22:24.2945999Z Alloc Alignment: 4KB 2025-12-04T09:22:24.2946551Z Accessible by all: TRUE 2025-12-04T09:22:24.2947026Z Pool 4 2025-12-04T09:22:24.2947448Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:22:24.2947951Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:22:24.2948461Z Allocatable: TRUE 2025-12-04T09:22:24.2949000Z Alloc Granule: 4KB 2025-12-04T09:22:24.2949558Z Alloc Recommended Granule:4KB 2025-12-04T09:22:24.2950134Z Alloc Alignment: 4KB 2025-12-04T09:22:24.2950946Z Accessible by all: TRUE 2025-12-04T09:22:24.2951434Z ISA Info: 2025-12-04T09:22:24.2951775Z ******* 2025-12-04T09:22:24.2952095Z Agent 2 2025-12-04T09:22:24.2952410Z ******* 2025-12-04T09:22:24.2952772Z Name: gfx1100 2025-12-04T09:22:24.2953275Z Uuid: GPU-0ac6979acc6ebee8 2025-12-04T09:22:24.2953814Z Marketing Name: Radeon RX 7900 XT 2025-12-04T09:22:24.2954350Z Vendor Name: AMD 2025-12-04T09:22:24.2954879Z Feature: KERNEL_DISPATCH 2025-12-04T09:22:24.2955402Z Profile: BASE_PROFILE 2025-12-04T09:22:24.2955932Z Float Round Mode: NEAR 2025-12-04T09:22:24.2956481Z Max Queue Number: 128(0x80) 2025-12-04T09:22:24.2957030Z Queue Min Size: 64(0x40) 2025-12-04T09:22:24.2957565Z Queue Max Size: 131072(0x20000) 2025-12-04T09:22:24.2958086Z Queue Type: MULTI 2025-12-04T09:22:24.2958568Z Node: 1 2025-12-04T09:22:24.2959061Z Device Type: GPU 2025-12-04T09:22:24.2959524Z Cache Info: 2025-12-04T09:22:24.2959902Z L1: 32(0x20) KB 2025-12-04T09:22:24.2960360Z L2: 6144(0x1800) KB 2025-12-04T09:22:24.2960805Z L3: 81920(0x14000) KB 2025-12-04T09:22:24.2961265Z Chip ID: 29772(0x744c) 2025-12-04T09:22:24.2961774Z ASIC Revision: 0(0x0) 2025-12-04T09:22:24.2962522Z Cacheline Size: 128(0x80) 2025-12-04T09:22:24.2963058Z Max Clock Freq. (MHz): 2025 2025-12-04T09:22:24.2963558Z BDFID: 17152 2025-12-04T09:22:24.2964048Z Internal Node ID: 1 2025-12-04T09:22:24.2964572Z Compute Unit: 84 2025-12-04T09:22:24.2965085Z SIMDs per CU: 2 2025-12-04T09:22:24.2965601Z Shader Engines: 6 2025-12-04T09:22:24.2966148Z Shader Arrs. per Eng.: 2 2025-12-04T09:22:24.2966801Z WatchPts on Addr. Ranges:4 2025-12-04T09:22:24.2967368Z Coherent Host Access: FALSE 2025-12-04T09:22:24.2967865Z Memory Properties: 2025-12-04T09:22:24.2968281Z Features: KERNEL_DISPATCH 2025-12-04T09:22:24.2968783Z Fast F16 Operation: TRUE 2025-12-04T09:22:24.2969338Z Wavefront Size: 32(0x20) 2025-12-04T09:22:24.2969893Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:22:24.2970401Z Workgroup Max Size per Dimension: 2025-12-04T09:22:24.2970820Z x 1024(0x400) 2025-12-04T09:22:24.2971250Z y 1024(0x400) 2025-12-04T09:22:24.2971678Z z 1024(0x400) 2025-12-04T09:22:24.2972159Z Max Waves Per CU: 32(0x20) 2025-12-04T09:22:24.2972692Z Max Work-item Per CU: 1024(0x400) 2025-12-04T09:22:24.2973232Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:22:24.2973714Z Grid Max Size per Dimension: 2025-12-04T09:22:24.2974109Z x 2147483647(0x7fffffff) 2025-12-04T09:22:24.2974806Z y 65535(0xffff) 2025-12-04T09:22:24.2975253Z z 65535(0xffff) 2025-12-04T09:22:24.2975761Z Max fbarriers/Workgrp: 32 2025-12-04T09:22:24.2983118Z Packet Processor uCode:: 552 2025-12-04T09:22:24.2983793Z SDMA engine uCode:: 24 2025-12-04T09:22:24.2984390Z IOMMU Support:: None 2025-12-04T09:22:24.2984881Z Pool Info: 2025-12-04T09:22:24.2985242Z Pool 1 2025-12-04T09:22:24.2985699Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:22:24.2986243Z Size: 20955136(0x13fc000) KB 2025-12-04T09:22:24.2986768Z Allocatable: TRUE 2025-12-04T09:22:24.2987354Z Alloc Granule: 4KB 2025-12-04T09:22:24.2987939Z Alloc Recommended Granule:2048KB 2025-12-04T09:22:24.2988526Z Alloc Alignment: 4KB 2025-12-04T09:22:24.2989095Z Accessible by all: FALSE 2025-12-04T09:22:24.2989579Z Pool 2 2025-12-04T09:22:24.2990034Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:22:24.2990587Z Size: 20955136(0x13fc000) KB 2025-12-04T09:22:24.2991157Z Allocatable: TRUE 2025-12-04T09:22:24.2991707Z Alloc Granule: 4KB 2025-12-04T09:22:24.2992280Z Alloc Recommended Granule:2048KB 2025-12-04T09:22:24.2992855Z Alloc Alignment: 4KB 2025-12-04T09:22:24.2993776Z Accessible by all: FALSE 2025-12-04T09:22:24.2994264Z Pool 3 2025-12-04T09:22:24.2994682Z Segment: GROUP 2025-12-04T09:22:24.2995179Z Size: 64(0x40) KB 2025-12-04T09:22:24.2995676Z Allocatable: FALSE 2025-12-04T09:22:24.2996216Z Alloc Granule: 0KB 2025-12-04T09:22:24.2996791Z Alloc Recommended Granule:0KB 2025-12-04T09:22:24.2997357Z Alloc Alignment: 0KB 2025-12-04T09:22:24.2997915Z Accessible by all: FALSE 2025-12-04T09:22:24.2998396Z ISA Info: 2025-12-04T09:22:24.2998729Z ISA 1 2025-12-04T09:22:24.2999142Z Name: amdgcn-amd-amdhsa--gfx1100 2025-12-04T09:22:24.2999703Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:22:24.3000269Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:22:24.3000835Z Default Rounding Mode: NEAR 2025-12-04T09:22:24.3001406Z Default Rounding Mode: NEAR 2025-12-04T09:22:24.3001933Z Fast f16: TRUE 2025-12-04T09:22:24.3002467Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:22:24.3002973Z Workgroup Max Size per Dimension: 2025-12-04T09:22:24.3003432Z x 1024(0x400) 2025-12-04T09:22:24.3003881Z y 1024(0x400) 2025-12-04T09:22:24.3004308Z z 1024(0x400) 2025-12-04T09:22:24.3004794Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:22:24.3005525Z Grid Max Size per Dimension: 2025-12-04T09:22:24.3005935Z x 2147483647(0x7fffffff) 2025-12-04T09:22:24.3006494Z y 65535(0xffff) 2025-12-04T09:22:24.3006918Z z 65535(0xffff) 2025-12-04T09:22:24.3007420Z FBarrier Max Size: 32 2025-12-04T09:22:24.3007886Z ISA 2 2025-12-04T09:22:24.3008305Z Name: amdgcn-amd-amdhsa--gfx11-generic 2025-12-04T09:22:24.3008864Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:22:24.3009418Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:22:24.3009977Z Default Rounding Mode: NEAR 2025-12-04T09:22:24.3010557Z Default Rounding Mode: NEAR 2025-12-04T09:22:24.3011101Z Fast f16: TRUE 2025-12-04T09:22:24.3011632Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:22:24.3012138Z Workgroup Max Size per Dimension: 2025-12-04T09:22:24.3012563Z x 1024(0x400) 2025-12-04T09:22:24.3013001Z y 1024(0x400) 2025-12-04T09:22:24.3013429Z z 1024(0x400) 2025-12-04T09:22:24.3013904Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:22:24.3014384Z Grid Max Size per Dimension: 2025-12-04T09:22:24.3014773Z x 2147483647(0x7fffffff) 2025-12-04T09:22:24.3015212Z y 65535(0xffff) 2025-12-04T09:22:24.3015643Z z 65535(0xffff) 2025-12-04T09:22:24.3016389Z FBarrier Max Size: 32 2025-12-04T09:22:24.3016867Z *** Done *** 2025-12-04T09:22:24.3084880Z ##[group]Run ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2025-12-04T09:22:24.3085493Z ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2025-12-04T09:22:24.3086646Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2025-12-04T09:22:24.3087569Z if [[ $ngpu -eq 0 ]]; then 2025-12-04T09:22:24.3088067Z  echo "Error: Failed to detect any GPUs on the runner" 2025-12-04T09:22:24.3088558Z  echo "$msg" 2025-12-04T09:22:24.3088873Z  exit 1 2025-12-04T09:22:24.3089167Z fi 2025-12-04T09:22:24.3112892Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:24.3113382Z env: 2025-12-04T09:22:24.3113669Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.3114019Z ##[endgroup] 2025-12-04T09:22:24.4119630Z ##[group]Run pytorch/pytorch/.github/actions/diskspace-cleanup@main 2025-12-04T09:22:24.4120167Z with: 2025-12-04T09:22:24.4120447Z diskspace-cutoff: 70 2025-12-04T09:22:24.4120746Z env: 2025-12-04T09:22:24.4121004Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.4121342Z ##[endgroup] 2025-12-04T09:22:24.4173863Z ##[group]Run set -ex 2025-12-04T09:22:24.4174228Z set -ex 2025-12-04T09:22:24.4174540Z diskspace_cutoff=70 2025-12-04T09:22:24.4175033Z docker_root_dir=$(docker info -f '{{.DockerRootDir}}') 2025-12-04T09:22:24.4175587Z if [ ! -d "$docker_root_dir" ]; then 2025-12-04T09:22:24.4176300Z  echo "Docker root directory ($docker_root_dir) does not exist. Skipping disk space check." 2025-12-04T09:22:24.4176959Z  exit 0 2025-12-04T09:22:24.4177244Z fi 2025-12-04T09:22:24.4177810Z diskspace=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2025-12-04T09:22:24.4179348Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2025-12-04T09:22:24.4180372Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2025-12-04T09:22:24.4180867Z  docker system prune -af 2025-12-04T09:22:24.4181535Z  diskspace_new=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2025-12-04T09:22:24.4182289Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2025-12-04T09:22:24.4182848Z  diskspace_cutoff_int=$((diskspace_cutoff + 0)) 2025-12-04T09:22:24.4183382Z  difference=$((100 - diskspace_cutoff_int)) 2025-12-04T09:22:24.4184109Z  echo "Error: Available diskspace is less than $difference percent. Not enough diskspace." 2025-12-04T09:22:24.4184770Z  echo "$msg" 2025-12-04T09:22:24.4185106Z  exit 1 2025-12-04T09:22:24.4185412Z  else 2025-12-04T09:22:24.4185779Z  difference=$((diskspace - diskspace_new)) 2025-12-04T09:22:24.4186299Z  echo "Diskspace saved: $difference percent" 2025-12-04T09:22:24.4186725Z  fi 2025-12-04T09:22:24.4186994Z fi 2025-12-04T09:22:24.4209540Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:24.4210034Z env: 2025-12-04T09:22:24.4210312Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.4210640Z ##[endgroup] 2025-12-04T09:22:24.4254444Z + diskspace_cutoff=70 2025-12-04T09:22:24.4259032Z ++ docker info -f '{{.DockerRootDir}}' 2025-12-04T09:22:24.4681033Z + docker_root_dir=/home/pytorchci/.local/share/docker 2025-12-04T09:22:24.4681644Z + '[' '!' -d /home/pytorchci/.local/share/docker ']' 2025-12-04T09:22:24.4687938Z ++ df -H --output=pcent /home/pytorchci/.local/share/docker 2025-12-04T09:22:24.4688406Z ++ sed -n 2p 2025-12-04T09:22:24.4689015Z ++ sed 's/ //' 2025-12-04T09:22:24.4689666Z ++ sed s/%// 2025-12-04T09:22:24.4709750Z + diskspace=13 2025-12-04T09:22:24.4710449Z + msg='Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified' 2025-12-04T09:22:24.4711178Z + [[ 13 -ge 70 ]] 2025-12-04T09:22:24.4733876Z ##[group]Run RUNNER_ARTIFACT_DIR="${RUNNER_TEMP}/artifacts" 2025-12-04T09:22:24.4734174Z RUNNER_ARTIFACT_DIR="${RUNNER_TEMP}/artifacts" 2025-12-04T09:22:24.4734408Z rm -rf "${RUNNER_ARTIFACT_DIR}" 2025-12-04T09:22:24.4734616Z mkdir -p "${RUNNER_ARTIFACT_DIR}" 2025-12-04T09:22:24.4734880Z echo "RUNNER_ARTIFACT_DIR=${RUNNER_ARTIFACT_DIR}" >> "${GITHUB_ENV}" 2025-12-04T09:22:24.4735134Z  2025-12-04T09:22:24.4735311Z RUNNER_TEST_RESULTS_DIR="${RUNNER_TEMP}/test-results" 2025-12-04T09:22:24.4735556Z rm -rf "${RUNNER_TEST_RESULTS_DIR}" 2025-12-04T09:22:24.4735765Z mkdir -p "${RUNNER_TEST_RESULTS_DIR}" 2025-12-04T09:22:24.4736063Z echo "RUNNER_TEST_RESULTS_DIR=${RUNNER_TEST_RESULTS_DIR}" >> "${GITHUB_ENV}" 2025-12-04T09:22:24.4736343Z  2025-12-04T09:22:24.4736484Z RUNNER_DOCS_DIR="${RUNNER_TEMP}/docs" 2025-12-04T09:22:24.4736676Z rm -rf "${RUNNER_DOCS_DIR}" 2025-12-04T09:22:24.4736864Z mkdir -p "${RUNNER_DOCS_DIR}" 2025-12-04T09:22:24.4737093Z echo "RUNNER_DOCS_DIR=${RUNNER_DOCS_DIR}" >> "${GITHUB_ENV}" 2025-12-04T09:22:24.4748171Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:24.4748405Z env: 2025-12-04T09:22:24.4748531Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.4748687Z ##[endgroup] 2025-12-04T09:22:24.4872871Z ##[group]Run env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:24.4873220Z env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:24.4873532Z env | grep '^CI' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:24.4883763Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:24.4883998Z env: 2025-12-04T09:22:24.4884286Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.4884539Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:24.4884903Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:24.4885249Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:24.4885490Z ##[endgroup] 2025-12-04T09:22:24.4970413Z ##[group]Run # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2025-12-04T09:22:24.4970877Z # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2025-12-04T09:22:24.4971189Z # Add render group for container creation. 2025-12-04T09:22:24.4971457Z render_gid=`cat /etc/group | grep render | cut -d: -f3` 2025-12-04T09:22:24.4971772Z # Ensure GPU isolation if pod is part of kubernetes setup with DEVICE_FLAG. 2025-12-04T09:22:24.4972110Z if [ -f "/etc/podinfo/gha-render-devices" ]; then 2025-12-04T09:22:24.4972377Z  DEVICE_FLAG=$(cat /etc/podinfo/gha-render-devices) 2025-12-04T09:22:24.4972596Z else 2025-12-04T09:22:24.4972747Z  DEVICE_FLAG="--device /dev/dri" 2025-12-04T09:22:24.4972929Z fi 2025-12-04T09:22:24.4973214Z # The --group-add daemon and --group-add bin are needed in the Ubuntu 24.04 and Almalinux OSs respectively. 2025-12-04T09:22:24.4973656Z # This is due to the device files (/dev/kfd & /dev/dri) being owned by video group on bare metal. 2025-12-04T09:22:24.4974073Z # This video group ID maps to subgid 1 inside the docker image due to the /etc/subgid entries. 2025-12-04T09:22:24.4974501Z # The group name corresponding to group ID 1 can change depending on the OS, so both are necessary. 2025-12-04T09:22:24.4975229Z echo "GPU_FLAG=--device=/dev/mem --device=/dev/kfd $DEVICE_FLAG --group-add video --group-add $render_gid --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host" >> "${GITHUB_ENV}" 2025-12-04T09:22:24.4987066Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:24.4987296Z env: 2025-12-04T09:22:24.4987429Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.4987686Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:24.4988054Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:24.4988384Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:24.4988621Z ##[endgroup] 2025-12-04T09:22:24.5080221Z ##[group]Run aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722 2025-12-04T09:22:24.5080517Z with: 2025-12-04T09:22:24.5080719Z role-to-assume: arn:aws:iam::308535385114:role/gha_workflow_s3_and_ecr_read_only 2025-12-04T09:22:24.5080991Z aws-region: us-east-1 2025-12-04T09:22:24.5081145Z role-duration-seconds: 18000 2025-12-04T09:22:24.5081325Z audience: sts.amazonaws.com 2025-12-04T09:22:24.5081478Z env: 2025-12-04T09:22:24.5081599Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.5081825Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:24.5082160Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:24.5082467Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:24.5083018Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:24.5083507Z ##[endgroup] 2025-12-04T09:22:24.7310170Z Assuming role with OIDC 2025-12-04T09:22:24.8890064Z Authenticated as assumedRoleId AROAUPVRELQNLLCOPFEJR:GitHubActions 2025-12-04T09:22:24.9447773Z ##[group]Run aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076 2025-12-04T09:22:24.9448082Z with: 2025-12-04T09:22:24.9448400Z mask-password: true 2025-12-04T09:22:24.9448568Z registry-type: private 2025-12-04T09:22:24.9448730Z skip-logout: false 2025-12-04T09:22:24.9448864Z env: 2025-12-04T09:22:24.9448997Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:24.9449248Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:24.9449611Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:24.9449966Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:24.9450571Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:24.9451126Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:24.9451310Z AWS_REGION: us-east-1 2025-12-04T09:22:24.9451739Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:24.9451991Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:24.9455447Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:24.9455613Z ##[endgroup] 2025-12-04T09:22:25.2129221Z Logging into registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:25.4765131Z ##[group]Run env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:25.4765964Z env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:25.4766819Z env | grep '^CI' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:25.4767535Z env | grep '^RUNNER' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:22:25.4793328Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:25.4793876Z env: 2025-12-04T09:22:25.4794205Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:25.4794809Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:25.4795994Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:25.4796825Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:25.4798230Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:25.4799514Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:25.4799939Z AWS_REGION: us-east-1 2025-12-04T09:22:25.4800498Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:25.4801055Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:25.4809409Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:25.4809795Z ##[endgroup] 2025-12-04T09:22:25.5011701Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2025-12-04T09:22:25.5012015Z with: 2025-12-04T09:22:25.5012461Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5012974Z use-custom-docker-registry: true 2025-12-04T09:22:25.5013179Z docker-build-dir: .ci/docker 2025-12-04T09:22:25.5013369Z docker-build-script: ./build.sh 2025-12-04T09:22:25.5013566Z working-directory: . 2025-12-04T09:22:25.5013785Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:25.5014039Z force-push: false 2025-12-04T09:22:25.5014189Z env: 2025-12-04T09:22:25.5014334Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:25.5014597Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:25.5014974Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:25.5015345Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:25.5015939Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:25.5016491Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:25.5016675Z AWS_REGION: us-east-1 2025-12-04T09:22:25.5016951Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:25.5017191Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:25.5020660Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:25.5020824Z ##[endgroup] 2025-12-04T09:22:25.5034052Z ##[group]Run set -ex 2025-12-04T09:22:25.5034224Z set -ex 2025-12-04T09:22:25.5034359Z  2025-12-04T09:22:25.5034592Z # If the docker build directory or the build script doesn't exist, the action will 2025-12-04T09:22:25.5034977Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2025-12-04T09:22:25.5035306Z # job could then download the pre-built image as usual 2025-12-04T09:22:25.5035703Z if [[ -d "${DOCKER_BUILD_DIR}" ]] && [[ -f "${DOCKER_BUILD_DIR}/${DOCKER_BUILD_SCRIPT}" ]] && [[ "${USE_CUSTOM_DOCKER_REGISTRY}" == "true" ]]; then 2025-12-04T09:22:25.5036077Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5036278Z else 2025-12-04T09:22:25.5036437Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5036695Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5036934Z  2025-12-04T09:22:25.5037256Z  echo "Not using custom ECR registry. Either it was not requested or there is no Docker build script in the ${REPO_NAME} repo..." 2025-12-04T09:22:25.5037623Z  exit 0 2025-12-04T09:22:25.5037757Z fi 2025-12-04T09:22:25.5037880Z  2025-12-04T09:22:25.5038091Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2025-12-04T09:22:25.5038445Z  # The docker image name already includes the ECR prefix and tag, so we can just 2025-12-04T09:22:25.5038766Z  # use it as it is, but first let's extract the tag 2025-12-04T09:22:25.5039050Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2025-12-04T09:22:25.5039493Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5039776Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5040014Z else 2025-12-04T09:22:25.5040176Z  if [[ "${DOCKER_IMAGE_NAME}" == *:* ]]; then 2025-12-04T09:22:25.5040405Z  CUSTOM_TAG_PREFIX=${DOCKER_IMAGE_NAME#*:} 2025-12-04T09:22:25.5040631Z  DOCKER_IMAGE_NAME=${DOCKER_IMAGE_NAME%%:*} 2025-12-04T09:22:25.5040824Z  fi 2025-12-04T09:22:25.5041195Z  DOCKER_TAG=${CUSTOM_TAG_PREFIX:+${CUSTOM_TAG_PREFIX}-}$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2025-12-04T09:22:25.5041549Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5041913Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5042317Z  echo "custom-tag-prefix=${CUSTOM_TAG_PREFIX}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5042565Z fi 2025-12-04T09:22:25.5054312Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:25.5054556Z env: 2025-12-04T09:22:25.5054699Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:25.5054963Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:25.5055334Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:25.5055676Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:25.5056272Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:25.5056813Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:25.5056995Z AWS_REGION: us-east-1 2025-12-04T09:22:25.5057206Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:25.5057472Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:25.5060909Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:25.5061077Z REPO_NAME: pytorch 2025-12-04T09:22:25.5061522Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5062000Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T09:22:25.5062181Z DOCKER_BUILD_SCRIPT: ./build.sh 2025-12-04T09:22:25.5062422Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:25.5062678Z USE_CUSTOM_DOCKER_REGISTRY: true 2025-12-04T09:22:25.5062865Z CUSTOM_TAG_PREFIX: 2025-12-04T09:22:25.5063020Z ##[endgroup] 2025-12-04T09:22:25.5092316Z + [[ -d .ci/docker ]] 2025-12-04T09:22:25.5092504Z + [[ -f .ci/docker/./build.sh ]] 2025-12-04T09:22:25.5092695Z + [[ true == \t\r\u\e ]] 2025-12-04T09:22:25.5092859Z + echo skip=false 2025-12-04T09:22:25.5095656Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2025-12-04T09:22:25.5103025Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5105161Z ++ awk -F '[:,]' '{print $2}' 2025-12-04T09:22:25.5124941Z + DOCKER_TAG=pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5126013Z + echo docker-tag=pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5127867Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5175599Z ##[group]Run set +e 2025-12-04T09:22:25.5175899Z set +e 2025-12-04T09:22:25.5176134Z set -x 2025-12-04T09:22:25.5176607Z  2025-12-04T09:22:25.5176816Z login() { 2025-12-04T09:22:25.5177300Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T09:22:25.5177805Z } 2025-12-04T09:22:25.5178011Z  2025-12-04T09:22:25.5178224Z retry () { 2025-12-04T09:22:25.5178492Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T09:22:25.5178799Z } 2025-12-04T09:22:25.5179006Z  2025-12-04T09:22:25.5179235Z retry login "${DOCKER_REGISTRY}" 2025-12-04T09:22:25.5179537Z  2025-12-04T09:22:25.5179755Z START_TIME=$(date +%s) 2025-12-04T09:22:25.5180052Z # Wait up to 120 minutes 2025-12-04T09:22:25.5180629Z while [[ $(( $(date +%s) - 7200 )) -lt $START_TIME ]]; do 2025-12-04T09:22:25.5181120Z  # Check if image already exists, if it does then skip building it 2025-12-04T09:22:25.5181615Z  if docker manifest inspect "${DOCKER_IMAGE}"; then 2025-12-04T09:22:25.5181979Z  exit 0 2025-12-04T09:22:25.5182214Z  fi 2025-12-04T09:22:25.5182435Z  2025-12-04T09:22:25.5182813Z  # NB: This flag is used by Docker build workflow to push the image to ECR, so we can 2025-12-04T09:22:25.5183450Z  # use this to differentiate between the Docker build and regular build jobs. For the 2025-12-04T09:22:25.5184081Z  # latter, it will wait for the Docker images to become available before continuing 2025-12-04T09:22:25.5184586Z  if [ "${DOCKER_PUSH:-false}" == "true" ]; then 2025-12-04T09:22:25.5184998Z  # It's a Docker build job, let's build the image 2025-12-04T09:22:25.5185349Z  break 2025-12-04T09:22:25.5185585Z  else 2025-12-04T09:22:25.5185925Z  # It's a regular build job, wait for the image to become available 2025-12-04T09:22:25.5186335Z  sleep 300 2025-12-04T09:22:25.5186585Z  fi 2025-12-04T09:22:25.5186811Z done 2025-12-04T09:22:25.5187030Z  2025-12-04T09:22:25.5187387Z # NB: This part requires a full checkout. Otherwise, the merge base will 2025-12-04T09:22:25.5187942Z # be empty. The default action would be to continue rebuild the image 2025-12-04T09:22:25.5188444Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2025-12-04T09:22:25.5188891Z  # if we're on the base branch then use the parent commit 2025-12-04T09:22:25.5189289Z  MERGE_BASE=$(git rev-parse HEAD~) 2025-12-04T09:22:25.5189600Z else 2025-12-04T09:22:25.5189915Z  # otherwise we're on a PR, so use the most recent base commit 2025-12-04T09:22:25.5190383Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2025-12-04T09:22:25.5190740Z fi 2025-12-04T09:22:25.5190952Z  2025-12-04T09:22:25.5191186Z if [[ -z "${MERGE_BASE}" ]]; then 2025-12-04T09:22:25.5191536Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5191870Z  2025-12-04T09:22:25.5192316Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2025-12-04T09:22:25.5192841Z  exit 0 2025-12-04T09:22:25.5193059Z fi 2025-12-04T09:22:25.5193265Z  2025-12-04T09:22:25.5193567Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2025-12-04T09:22:25.5194221Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2025-12-04T09:22:25.5194775Z  exit 1 2025-12-04T09:22:25.5194998Z fi 2025-12-04T09:22:25.5195210Z  2025-12-04T09:22:25.5195565Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2025-12-04T09:22:25.5196194Z # If no image exists but the hash is the same as the previous hash then we should error out here 2025-12-04T09:22:25.5196908Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2025-12-04T09:22:25.5197565Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2025-12-04T09:22:25.5198302Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2025-12-04T09:22:25.5198739Z fi 2025-12-04T09:22:25.5198940Z  2025-12-04T09:22:25.5199203Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T09:22:25.5217002Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:25.5217375Z env: 2025-12-04T09:22:25.5217606Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:25.5218198Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:25.5218790Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:25.5219352Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:25.5220317Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:25.5221183Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:25.5221472Z AWS_REGION: us-east-1 2025-12-04T09:22:25.5221878Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:25.5222261Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:25.5227712Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:25.5227995Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T09:22:25.5228343Z BASE_REVISION: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:22:25.5229163Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5230088Z DOCKER_TAG: pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:25.5230678Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:25.5231072Z DOCKER_PUSH: 2025-12-04T09:22:25.5231306Z ##[endgroup] 2025-12-04T09:22:25.5271123Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:25.5271415Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:25.5276667Z + aws ecr get-login-password --region us-east-1 2025-12-04T09:22:25.5277549Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:26.3808144Z 2025-12-04T09:22:26.3808394Z Login Succeeded 2025-12-04T09:22:26.3809199Z WARNING! Your credentials are stored unencrypted in '/home/pytorchci/.docker/config.json'. 2025-12-04T09:22:26.3810085Z Configure a credential helper to remove this warning. See 2025-12-04T09:22:26.3810755Z https://docs.docker.com/go/credential-store/ 2025-12-04T09:22:26.3811124Z 2025-12-04T09:22:26.3842435Z ++ date +%s 2025-12-04T09:22:26.3851787Z + START_TIME=1764840146 2025-12-04T09:22:26.3856352Z ++ date +%s 2025-12-04T09:22:26.3869639Z + [[ 1764832946 -lt 1764840146 ]] 2025-12-04T09:22:26.3870940Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:26.9944737Z { 2025-12-04T09:22:26.9945152Z "schemaVersion": 2, 2025-12-04T09:22:26.9945827Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2025-12-04T09:22:26.9946485Z "config": { 2025-12-04T09:22:26.9946978Z "mediaType": "application/vnd.docker.container.image.v1+json", 2025-12-04T09:22:26.9947559Z "size": 30520, 2025-12-04T09:22:26.9948147Z "digest": "sha256:45252333063339f104d56e41f20304e9511ab21c7768e8d156b95ddf24a9dbe5" 2025-12-04T09:22:26.9948802Z }, 2025-12-04T09:22:26.9949124Z "layers": [ 2025-12-04T09:22:26.9949427Z { 2025-12-04T09:22:26.9949901Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9950518Z "size": 30447951, 2025-12-04T09:22:26.9951123Z "digest": "sha256:63e5bc7682b85ae57a1221210f64d62e7a90b0a30f19af4ca734b8242ae49d63" 2025-12-04T09:22:26.9952355Z }, 2025-12-04T09:22:26.9952645Z { 2025-12-04T09:22:26.9953111Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9953689Z "size": 1554, 2025-12-04T09:22:26.9954266Z "digest": "sha256:835841cca3b7e1464290cdb78e48773e03583413fbed852c3cc5165a392ea44d" 2025-12-04T09:22:26.9954909Z }, 2025-12-04T09:22:26.9955183Z { 2025-12-04T09:22:26.9955635Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9956205Z "size": 313275691, 2025-12-04T09:22:26.9956735Z "digest": "sha256:aac69780afc8611a5f94a235792d39ae055249c8319ef43b78675998a9b2f825" 2025-12-04T09:22:26.9957073Z }, 2025-12-04T09:22:26.9957361Z { 2025-12-04T09:22:26.9957607Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9957903Z "size": 704, 2025-12-04T09:22:26.9958204Z "digest": "sha256:029495b23122c840ca0e52d487afa8d2c4dbf1991cd7f204ec3e434dcf947bf4" 2025-12-04T09:22:26.9958551Z }, 2025-12-04T09:22:26.9958697Z { 2025-12-04T09:22:26.9958936Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9959286Z "size": 1218, 2025-12-04T09:22:26.9959595Z "digest": "sha256:d0fb85b008332051a3f7c052721ef68bde404b46c23fa43ad040373bd367826c" 2025-12-04T09:22:26.9959938Z }, 2025-12-04T09:22:26.9960083Z { 2025-12-04T09:22:26.9960324Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9960630Z "size": 484, 2025-12-04T09:22:26.9960930Z "digest": "sha256:59b63930883363c7d2aaab27cc61555d9f3e119dc18247a8624c98ebdaa354a5" 2025-12-04T09:22:26.9961269Z }, 2025-12-04T09:22:26.9961411Z { 2025-12-04T09:22:26.9961652Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9961957Z "size": 110363202, 2025-12-04T09:22:26.9962271Z "digest": "sha256:dc112c89d57aa1e85082e40a56e5bc743d64f834ae2f98afe91f60c248354d38" 2025-12-04T09:22:26.9962611Z }, 2025-12-04T09:22:26.9962760Z { 2025-12-04T09:22:26.9962995Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9963293Z "size": 4436, 2025-12-04T09:22:26.9963587Z "digest": "sha256:522eab2402e5001810155ef7eb56940b7c01a4fef62ac588886981c3b8ee8e1e" 2025-12-04T09:22:26.9963925Z }, 2025-12-04T09:22:26.9964068Z { 2025-12-04T09:22:26.9964301Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9964598Z "size": 1755, 2025-12-04T09:22:26.9964893Z "digest": "sha256:2b5a11b41761d8ea3b829e4772e4064cb6c4e4989126af324d0057661e4493a1" 2025-12-04T09:22:26.9965228Z }, 2025-12-04T09:22:26.9965373Z { 2025-12-04T09:22:26.9965615Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9965912Z "size": 724, 2025-12-04T09:22:26.9966456Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:22:26.9967109Z }, 2025-12-04T09:22:26.9967370Z { 2025-12-04T09:22:26.9967817Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9968384Z "size": 3185588166, 2025-12-04T09:22:26.9968964Z "digest": "sha256:73e33534e9eb94cf29418d65944168962b65fe21f55e9b8bad18c76e9b3a37b8" 2025-12-04T09:22:26.9969595Z }, 2025-12-04T09:22:26.9969858Z { 2025-12-04T09:22:26.9970285Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9970836Z "size": 396, 2025-12-04T09:22:26.9971406Z "digest": "sha256:5bfdaeb5578d6ffcd7db29c48303cbceb13c591210feaa216a8daa7a6d445b4b" 2025-12-04T09:22:26.9972064Z }, 2025-12-04T09:22:26.9972328Z { 2025-12-04T09:22:26.9972766Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9973336Z "size": 236863, 2025-12-04T09:22:26.9973929Z "digest": "sha256:c07d27e4d3a5ba4ad5325bb785b2e4f058fe5e10ec1aeeb413a1e152b073f203" 2025-12-04T09:22:26.9974586Z }, 2025-12-04T09:22:26.9974847Z { 2025-12-04T09:22:26.9975287Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9976127Z "size": 787, 2025-12-04T09:22:26.9976684Z "digest": "sha256:b21856d1bf420da6fa8ec7331b82ab355d4f4178644e7d3a3d3d0fbc3610109a" 2025-12-04T09:22:26.9977334Z }, 2025-12-04T09:22:26.9977602Z { 2025-12-04T09:22:26.9978039Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9978598Z "size": 106, 2025-12-04T09:22:26.9979155Z "digest": "sha256:cb19d84867e4063f55db9459c28c50a2abc37c06d3c1ca82ba95fa8427cc438a" 2025-12-04T09:22:26.9979933Z }, 2025-12-04T09:22:26.9980256Z { 2025-12-04T09:22:26.9980707Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9981310Z "size": 1496, 2025-12-04T09:22:26.9982200Z "digest": "sha256:8165374f8dccf88a7791a5d31afbe29e4d4542b4f1cf1904945e07f9af6bf8ba" 2025-12-04T09:22:26.9982891Z }, 2025-12-04T09:22:26.9983167Z { 2025-12-04T09:22:26.9983642Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9984275Z "size": 458789560, 2025-12-04T09:22:26.9984887Z "digest": "sha256:1aecc77354ceba59ec6f0d37a558f2dbb6d5c0854553ee8505ac8707b422da6d" 2025-12-04T09:22:26.9985545Z }, 2025-12-04T09:22:26.9985814Z { 2025-12-04T09:22:26.9986266Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9986876Z "size": 164, 2025-12-04T09:22:26.9987455Z "digest": "sha256:465d3fd643aa2ea0ad07335cda66f12f1d7e5e800c4e9385ec466bc8a1ceabda" 2025-12-04T09:22:26.9988030Z }, 2025-12-04T09:22:26.9988172Z { 2025-12-04T09:22:26.9988414Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9988720Z "size": 104, 2025-12-04T09:22:26.9989025Z "digest": "sha256:6c503e779d6f41ca7f51309875df2b725c171926aece7009c4b8a64d1ba3f58e" 2025-12-04T09:22:26.9989369Z }, 2025-12-04T09:22:26.9989515Z { 2025-12-04T09:22:26.9989750Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9990048Z "size": 724, 2025-12-04T09:22:26.9990341Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:22:26.9990673Z }, 2025-12-04T09:22:26.9990815Z { 2025-12-04T09:22:26.9991048Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9991350Z "size": 196, 2025-12-04T09:22:26.9991644Z "digest": "sha256:f7e9a021f0ee3d11a50dcb96378af8103a21f6c3c142f54529207648f3ed00b2" 2025-12-04T09:22:26.9991983Z }, 2025-12-04T09:22:26.9992128Z { 2025-12-04T09:22:26.9992363Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9992669Z "size": 2583, 2025-12-04T09:22:26.9992971Z "digest": "sha256:8e023b349080fb11ee55491bc9b842b30e9e3a90246d05b303a73dc62038caf2" 2025-12-04T09:22:26.9993314Z }, 2025-12-04T09:22:26.9993457Z { 2025-12-04T09:22:26.9993694Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9994000Z "size": 7577171420, 2025-12-04T09:22:26.9994308Z "digest": "sha256:8188df80e595a3dbcf84623c6a58a655269898cbb60029435f136d7f9d34ccaa" 2025-12-04T09:22:26.9994652Z }, 2025-12-04T09:22:26.9994796Z { 2025-12-04T09:22:26.9995032Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9995331Z "size": 135, 2025-12-04T09:22:26.9995640Z "digest": "sha256:3c2c2f8c74bfa16c4bf9a832c97bbb1d55205b2b4a2cead02cf74301ca1001fb" 2025-12-04T09:22:26.9995987Z }, 2025-12-04T09:22:26.9996133Z { 2025-12-04T09:22:26.9996400Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9996881Z "size": 104, 2025-12-04T09:22:26.9997350Z "digest": "sha256:2aa7784fbe3300f8bbfb6bb51cff3b01fd091e829c2bc7ab9e25261a0dd9b3bd" 2025-12-04T09:22:26.9997907Z }, 2025-12-04T09:22:26.9998132Z { 2025-12-04T09:22:26.9998499Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:26.9998972Z "size": 612, 2025-12-04T09:22:26.9999438Z "digest": "sha256:2b3b5215d3ebe8789f0444457bfd5a6e218289b64aa07653ac3d03ddda5e6708" 2025-12-04T09:22:27.0000160Z }, 2025-12-04T09:22:27.0000381Z { 2025-12-04T09:22:27.0000751Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0001225Z "size": 838191945, 2025-12-04T09:22:27.0001723Z "digest": "sha256:99b1f1ea3e857834cebd01763d90fbd700aeb9c2d2ef23eda2cfff5652c9708b" 2025-12-04T09:22:27.0002266Z }, 2025-12-04T09:22:27.0002493Z { 2025-12-04T09:22:27.0002857Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0003328Z "size": 111, 2025-12-04T09:22:27.0003800Z "digest": "sha256:18d6daba0a5768a37ad106b57974f6b7efd35c43a87c246bcd3f43fea88f2d2b" 2025-12-04T09:22:27.0004342Z }, 2025-12-04T09:22:27.0004811Z { 2025-12-04T09:22:27.0005181Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0005651Z "size": 1555, 2025-12-04T09:22:27.0006127Z "digest": "sha256:5277f2a503ebd17ba9d9b86cc9bac86265504adeb449c0647616ddaacd3cbc41" 2025-12-04T09:22:27.0006832Z }, 2025-12-04T09:22:27.0007107Z { 2025-12-04T09:22:27.0007546Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0008108Z "size": 107, 2025-12-04T09:22:27.0008659Z "digest": "sha256:3198a9717aace920fd5de085319adf75091af05fc4318ce4b16a8a5b0e8d449e" 2025-12-04T09:22:27.0009302Z }, 2025-12-04T09:22:27.0009570Z { 2025-12-04T09:22:27.0010012Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0010573Z "size": 166, 2025-12-04T09:22:27.0011118Z "digest": "sha256:99a4918e5808277879449e97ccd7190db6b9aa2d742b57a3b831ce0198522bdd" 2025-12-04T09:22:27.0011755Z }, 2025-12-04T09:22:27.0012020Z { 2025-12-04T09:22:27.0012463Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0013033Z "size": 3526081, 2025-12-04T09:22:27.0013606Z "digest": "sha256:15bb11dfc6acc3537d527d6771c8e711e5605e99f82ec41e805d4600b8a97516" 2025-12-04T09:22:27.0014246Z }, 2025-12-04T09:22:27.0014520Z { 2025-12-04T09:22:27.0014962Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0015527Z "size": 107, 2025-12-04T09:22:27.0016090Z "digest": "sha256:bd87c8766e90e33db17514558ac591cc3f4149afd7abeaef4dd5770bbfa14210" 2025-12-04T09:22:27.0016745Z }, 2025-12-04T09:22:27.0017010Z { 2025-12-04T09:22:27.0017448Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0018012Z "size": 829, 2025-12-04T09:22:27.0018571Z "digest": "sha256:1969e15d0c13874ea5883ed829235a19ef6dc21c8aa6172032b78a8ffa6ff262" 2025-12-04T09:22:27.0019210Z }, 2025-12-04T09:22:27.0019473Z { 2025-12-04T09:22:27.0019923Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0020502Z "size": 26973054, 2025-12-04T09:22:27.0021089Z "digest": "sha256:24a03847d382b73c11969f8f73916a6bedf5ccea12f6f4290b3880f29ceda32a" 2025-12-04T09:22:27.0021734Z }, 2025-12-04T09:22:27.0021997Z { 2025-12-04T09:22:27.0022439Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0023002Z "size": 104, 2025-12-04T09:22:27.0023567Z "digest": "sha256:816e2e34e01839a35d624dbf4bd9ac9bea4c975104af47a0e6b6b6dee6c6f98d" 2025-12-04T09:22:27.0024210Z }, 2025-12-04T09:22:27.0024477Z { 2025-12-04T09:22:27.0024916Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0025481Z "size": 424, 2025-12-04T09:22:27.0026034Z "digest": "sha256:b168858b85373f8ddca549d79267a06de4fa945d04bf791c55c9ddc93957fa3c" 2025-12-04T09:22:27.0026683Z }, 2025-12-04T09:22:27.0026948Z { 2025-12-04T09:22:27.0027384Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0027956Z "size": 19309386, 2025-12-04T09:22:27.0028548Z "digest": "sha256:6b8d5ff02e267e38322afbb8a58ed63ce9d75b10e9e73255e6affcbc6b6539bf" 2025-12-04T09:22:27.0029200Z }, 2025-12-04T09:22:27.0029458Z { 2025-12-04T09:22:27.0029893Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0030725Z "size": 826, 2025-12-04T09:22:27.0031285Z "digest": "sha256:4e3b10a5dd6aed29f238d604925e2a4f873141c1087c8dd4fdde5c61e7560893" 2025-12-04T09:22:27.0031928Z }, 2025-12-04T09:22:27.0032196Z { 2025-12-04T09:22:27.0032632Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0033192Z "size": 724, 2025-12-04T09:22:27.0033740Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:22:27.0034367Z }, 2025-12-04T09:22:27.0034630Z { 2025-12-04T09:22:27.0035074Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0035641Z "size": 149, 2025-12-04T09:22:27.0036439Z "digest": "sha256:3092fab73b59190b9facfc49bf18f58612172bc2fd68dfa339a1118632616939" 2025-12-04T09:22:27.0037089Z }, 2025-12-04T09:22:27.0037355Z { 2025-12-04T09:22:27.0037795Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0038371Z "size": 136, 2025-12-04T09:22:27.0038944Z "digest": "sha256:20020dd28a15ba092fcbfe906ee39cdddfcc9d0b7eb42fdd6f4c08a984fa9c00" 2025-12-04T09:22:27.0039600Z }, 2025-12-04T09:22:27.0039863Z { 2025-12-04T09:22:27.0040301Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0040865Z "size": 140, 2025-12-04T09:22:27.0041428Z "digest": "sha256:ae5280ce969dcff08c091e9a5f7641f13561b2b0ee44d78b7c3f81d8fe8e6d32" 2025-12-04T09:22:27.0042076Z }, 2025-12-04T09:22:27.0042340Z { 2025-12-04T09:22:27.0042779Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0043347Z "size": 32, 2025-12-04T09:22:27.0043922Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:22:27.0044573Z }, 2025-12-04T09:22:27.0044836Z { 2025-12-04T09:22:27.0045277Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0045836Z "size": 222, 2025-12-04T09:22:27.0046500Z "digest": "sha256:fe17d9eb0fd26d3af4c724bf570d833978b131cedb7dc17a800aa388a246b3cd" 2025-12-04T09:22:27.0047157Z }, 2025-12-04T09:22:27.0047434Z { 2025-12-04T09:22:27.0047872Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0048433Z "size": 346, 2025-12-04T09:22:27.0048978Z "digest": "sha256:a51e0dab2d596e6563483f27c12660007160847d177ba4c31812a8f44ada5754" 2025-12-04T09:22:27.0049615Z }, 2025-12-04T09:22:27.0049881Z { 2025-12-04T09:22:27.0050318Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0050881Z "size": 88300, 2025-12-04T09:22:27.0051467Z "digest": "sha256:6eb176cefd72d37ecbcdf074289a8f1de732d8816cc695ece7e4709d098094d6" 2025-12-04T09:22:27.0052120Z }, 2025-12-04T09:22:27.0052385Z { 2025-12-04T09:22:27.0052829Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0053385Z "size": 106, 2025-12-04T09:22:27.0053944Z "digest": "sha256:e7b8cf2e8d5a4c56db9726ce62c1176032408b3b1c25a000592361cb4245e2b5" 2025-12-04T09:22:27.0054607Z }, 2025-12-04T09:22:27.0054878Z { 2025-12-04T09:22:27.0055341Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0055919Z "size": 1671, 2025-12-04T09:22:27.0056504Z "digest": "sha256:ef3a5060abce88884bc8bd815aa41c46427f34eeb132fe0ddd85a3f86e6dc83d" 2025-12-04T09:22:27.0057167Z }, 2025-12-04T09:22:27.0057437Z { 2025-12-04T09:22:27.0057888Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0058467Z "size": 724, 2025-12-04T09:22:27.0059023Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:22:27.0059653Z }, 2025-12-04T09:22:27.0059921Z { 2025-12-04T09:22:27.0060370Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0060940Z "size": 138, 2025-12-04T09:22:27.0061514Z "digest": "sha256:a6f4ec14b42b8f0a83d20aa6a985ddb6a1bf64e0ed3d44afd3484b87d4ed5ad3" 2025-12-04T09:22:27.0062439Z }, 2025-12-04T09:22:27.0062700Z { 2025-12-04T09:22:27.0063142Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0063717Z "size": 119, 2025-12-04T09:22:27.0064282Z "digest": "sha256:7e5a0c956cfbd6f8074fbfd3b1d416e6635d632835ec00c8dd4c015a21da19b4" 2025-12-04T09:22:27.0064928Z }, 2025-12-04T09:22:27.0065197Z { 2025-12-04T09:22:27.0065642Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0066216Z "size": 6238423049, 2025-12-04T09:22:27.0066819Z "digest": "sha256:b4f78730cfe76ce091b78b2e2e3d52be03f1097b3e4c3de5bd79f8d13a853132" 2025-12-04T09:22:27.0067469Z }, 2025-12-04T09:22:27.0067734Z { 2025-12-04T09:22:27.0068425Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0069005Z "size": 174, 2025-12-04T09:22:27.0069549Z "digest": "sha256:081028f24389b112683689fd362e8c0d6f358082710e72feab91cea6383feb4d" 2025-12-04T09:22:27.0070185Z }, 2025-12-04T09:22:27.0070447Z { 2025-12-04T09:22:27.0070887Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0071456Z "size": 1896, 2025-12-04T09:22:27.0072044Z "digest": "sha256:a534dcf4b9a9e5fabed742c8a8fc43c9cfe7346ea88ab3c177c3b14fd3afe00a" 2025-12-04T09:22:27.0072700Z }, 2025-12-04T09:22:27.0072964Z { 2025-12-04T09:22:27.0073400Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0073968Z "size": 197577597, 2025-12-04T09:22:27.0074547Z "digest": "sha256:2e77500302cc13224427e1d74e471bd79d5109ba6a5099a83df1d10b786f71ba" 2025-12-04T09:22:27.0075180Z }, 2025-12-04T09:22:27.0075445Z { 2025-12-04T09:22:27.0075894Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0076457Z "size": 304, 2025-12-04T09:22:27.0077027Z "digest": "sha256:bc08246bb4ba18c3ec5bc69e16b6b4e929c5bd0f3fae10eeb0b1a622a63d6fa2" 2025-12-04T09:22:27.0077682Z }, 2025-12-04T09:22:27.0077957Z { 2025-12-04T09:22:27.0078396Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0078961Z "size": 32, 2025-12-04T09:22:27.0079529Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:22:27.0080180Z }, 2025-12-04T09:22:27.0080441Z { 2025-12-04T09:22:27.0080888Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0081453Z "size": 106, 2025-12-04T09:22:27.0082016Z "digest": "sha256:ff0c473ca120ebdcaa2ba10b3274e82032edd5196019e76d4e7584553704ae81" 2025-12-04T09:22:27.0082665Z }, 2025-12-04T09:22:27.0082935Z { 2025-12-04T09:22:27.0083386Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:22:27.0083955Z "size": 54145662, 2025-12-04T09:22:27.0084546Z "digest": "sha256:6bbc14b250efb3cdaad12c91573c6bb9129ad3e3432f0ed1a7eaebc9958d162f" 2025-12-04T09:22:27.0085203Z } 2025-12-04T09:22:27.0085469Z ] 2025-12-04T09:22:27.0085744Z } 2025-12-04T09:22:27.0086047Z + exit 0 2025-12-04T09:22:27.0129228Z ##[group]Run set -eux 2025-12-04T09:22:27.0129632Z set -eux 2025-12-04T09:22:27.0130252Z # It's ok if this steps fails, it would then be an anonymous user like what we used to have 2025-12-04T09:22:27.0131881Z aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token | jq --raw-output '.SecretString' | jq -r .docker_hub_readonly_token | docker login --username pytorchbot --password-stdin || true 2025-12-04T09:22:27.0158466Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:27.0159029Z env: 2025-12-04T09:22:27.0159357Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:27.0159972Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:27.0160848Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:27.0161649Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:27.0163339Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:27.0164618Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:27.0165033Z AWS_REGION: us-east-1 2025-12-04T09:22:27.0165614Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:27.0166269Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:27.0174511Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:27.0174894Z ##[endgroup] 2025-12-04T09:22:27.0234704Z + aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token 2025-12-04T09:22:27.0236247Z + jq --raw-output .SecretString 2025-12-04T09:22:27.0237264Z + jq -r .docker_hub_readonly_token 2025-12-04T09:22:27.0239804Z + docker login --username pytorchbot --password-stdin 2025-12-04T09:22:27.5833104Z 2025-12-04T09:22:27.5835583Z An error occurred (AccessDeniedException) when calling the GetSecretValue operation: User: arn:aws:sts::308535385114:assumed-role/gha_workflow_s3_and_ecr_read_only/GitHubActions is not authorized to perform: secretsmanager:GetSecretValue on resource: docker_hub_readonly_token because no identity-based policy allows the secretsmanager:GetSecretValue action 2025-12-04T09:22:27.6514714Z error: cannot perform an interactive login from a non TTY device 2025-12-04T09:22:27.6533375Z + true 2025-12-04T09:22:27.6680790Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2025-12-04T09:22:27.6681458Z with: 2025-12-04T09:22:27.6682470Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:27.6683719Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:27.6684271Z env: 2025-12-04T09:22:27.6684586Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:27.6685179Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:27.6686033Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:27.6686986Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:27.6688452Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:27.6689741Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:27.6690151Z AWS_REGION: us-east-1 2025-12-04T09:22:27.6690619Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:27.6691163Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:27.6699279Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:27.6699655Z ##[endgroup] 2025-12-04T09:22:27.6724745Z ##[group]Run set -x 2025-12-04T09:22:27.6725144Z set -x 2025-12-04T09:22:27.6725477Z set +e 2025-12-04T09:22:27.6725794Z  2025-12-04T09:22:27.6726109Z login() { 2025-12-04T09:22:27.6726942Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T09:22:27.6727711Z } 2025-12-04T09:22:27.6728021Z  2025-12-04T09:22:27.6728331Z retry () { 2025-12-04T09:22:27.6728730Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T09:22:27.6729181Z } 2025-12-04T09:22:27.6729480Z  2025-12-04T09:22:27.6729824Z retry login "${DOCKER_REGISTRY}" 2025-12-04T09:22:27.6730278Z  2025-12-04T09:22:27.6730997Z IMAGE_SIZE=$(docker manifest inspect "${DOCKER_IMAGE}" | jq '[.layers[].size, .config.size] | add / 1024 / 1024') 2025-12-04T09:22:27.6731945Z echo "Compressed size of image in MB: ${IMAGE_SIZE}" 2025-12-04T09:22:27.6732479Z  2025-12-04T09:22:27.6732782Z set -e 2025-12-04T09:22:27.6733284Z # ignore output since only exit code is used for conditional 2025-12-04T09:22:27.6733992Z # only pull docker image if it's not available locally 2025-12-04T09:22:27.6735098Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2025-12-04T09:22:27.6735818Z  retry docker pull "${DOCKER_IMAGE}" 2025-12-04T09:22:27.6736276Z fi 2025-12-04T09:22:27.6760597Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:22:27.6761149Z env: 2025-12-04T09:22:27.6761485Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:22:27.6762093Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:22:27.6762964Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:22:27.6763779Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:22:27.6765188Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:22:27.6766553Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:22:27.6766981Z AWS_REGION: us-east-1 2025-12-04T09:22:27.6767472Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:22:27.6768022Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:22:27.6776231Z AWS_SESSION_TOKEN: *** 2025-12-04T09:22:27.6777644Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:27.6778918Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:27.6779484Z ##[endgroup] 2025-12-04T09:22:27.6832539Z + set +e 2025-12-04T09:22:27.6833091Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:27.6833743Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:27.6838724Z + aws ecr get-login-password --region us-east-1 2025-12-04T09:22:27.6840918Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:22:28.5482641Z 2025-12-04T09:22:28.5483188Z Login Succeeded 2025-12-04T09:22:28.5484048Z WARNING! Your credentials are stored unencrypted in '/home/pytorchci/.docker/config.json'. 2025-12-04T09:22:28.5484986Z Configure a credential helper to remove this warning. See 2025-12-04T09:22:28.5485646Z https://docs.docker.com/go/credential-store/ 2025-12-04T09:22:28.5486008Z 2025-12-04T09:22:28.5530456Z ++ docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:28.5531786Z ++ jq '[.layers[].size, .config.size] | add / 1024 / 1024' 2025-12-04T09:22:29.1658804Z + IMAGE_SIZE=18171.470620155334 2025-12-04T09:22:29.1659316Z Compressed size of image in MB: 18171.470620155334 2025-12-04T09:22:29.1659857Z + echo 'Compressed size of image in MB: 18171.470620155334' 2025-12-04T09:22:29.1660313Z + set -e 2025-12-04T09:22:29.1661250Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:29.1794503Z + retry docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:29.1795911Z + docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:22:29.3992204Z pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a: Pulling from pytorch/ci-image 2025-12-04T09:22:29.8119672Z aac69780afc8: Pulling fs layer 2025-12-04T09:22:29.8120160Z 522eab2402e5: Pulling fs layer 2025-12-04T09:22:29.8120527Z 59b639308833: Pulling fs layer 2025-12-04T09:22:29.8120814Z 6bbc14b250ef: Pulling fs layer 2025-12-04T09:22:29.8121047Z 2aa7784fbe33: Pulling fs layer 2025-12-04T09:22:29.8121394Z dc112c89d57a: Pulling fs layer 2025-12-04T09:22:29.8121706Z 8e023b349080: Pulling fs layer 2025-12-04T09:22:29.8121919Z 816e2e34e018: Pulling fs layer 2025-12-04T09:22:29.8122189Z 465d3fd643aa: Pulling fs layer 2025-12-04T09:22:29.8122808Z a6f4ec14b42b: Pulling fs layer 2025-12-04T09:22:29.8123045Z 15bb11dfc6ac: Pulling fs layer 2025-12-04T09:22:29.8123276Z fe17d9eb0fd2: Pulling fs layer 2025-12-04T09:22:29.8123508Z 2b3b5215d3eb: Pulling fs layer 2025-12-04T09:22:29.8123856Z ef3a5060abce: Pulling fs layer 2025-12-04T09:22:29.8124196Z 24a03847d382: Pulling fs layer 2025-12-04T09:22:29.8124551Z 1969e15d0c13: Pulling fs layer 2025-12-04T09:22:29.8124889Z 99b1f1ea3e85: Pulling fs layer 2025-12-04T09:22:29.8125234Z 3c2c2f8c74bf: Pulling fs layer 2025-12-04T09:22:29.8125576Z 73e33534e9eb: Pulling fs layer 2025-12-04T09:22:29.8125806Z b168858b8537: Pulling fs layer 2025-12-04T09:22:29.8126012Z 9681563a88ff: Pulling fs layer 2025-12-04T09:22:29.8126363Z e7b8cf2e8d5a: Pulling fs layer 2025-12-04T09:22:29.8126573Z 18d6daba0a57: Pulling fs layer 2025-12-04T09:22:29.8126817Z bd87c8766e90: Pulling fs layer 2025-12-04T09:22:29.8127172Z 9681563a88ff: Pulling fs layer 2025-12-04T09:22:29.8127501Z 7e5a0c956cfb: Pulling fs layer 2025-12-04T09:22:29.8127737Z 029495b23122: Pulling fs layer 2025-12-04T09:22:29.8128077Z 20020dd28a15: Pulling fs layer 2025-12-04T09:22:29.8128421Z 4f4fb700ef54: Pulling fs layer 2025-12-04T09:22:29.8128715Z 6c503e779d6f: Pulling fs layer 2025-12-04T09:22:29.8129063Z cb19d84867e4: Pulling fs layer 2025-12-04T09:22:29.8129343Z ff0c473ca120: Pulling fs layer 2025-12-04T09:22:29.8129889Z 8188df80e595: Pulling fs layer 2025-12-04T09:22:29.8130194Z b4f78730cfe7: Pulling fs layer 2025-12-04T09:22:29.8130402Z 2b5a11b41761: Pulling fs layer 2025-12-04T09:22:29.8130717Z c07d27e4d3a5: Pulling fs layer 2025-12-04T09:22:29.8131063Z 5bfdaeb5578d: Pulling fs layer 2025-12-04T09:22:29.8131418Z 3198a9717aac: Pulling fs layer 2025-12-04T09:22:29.8131759Z 8165374f8dcc: Pulling fs layer 2025-12-04T09:22:29.8132103Z ae5280ce969d: Pulling fs layer 2025-12-04T09:22:29.8132408Z f7e9a021f0ee: Pulling fs layer 2025-12-04T09:22:29.8132751Z b21856d1bf42: Pulling fs layer 2025-12-04T09:22:29.8133090Z 835841cca3b7: Pulling fs layer 2025-12-04T09:22:29.8133452Z a534dcf4b9a9: Pulling fs layer 2025-12-04T09:22:29.8133798Z d0fb85b00833: Pulling fs layer 2025-12-04T09:22:29.8134139Z 4f4fb700ef54: Pulling fs layer 2025-12-04T09:22:29.8134485Z 6eb176cefd72: Pulling fs layer 2025-12-04T09:22:29.8134837Z 4e3b10a5dd6a: Pulling fs layer 2025-12-04T09:22:29.8135178Z 2e77500302cc: Pulling fs layer 2025-12-04T09:22:29.8135521Z 9681563a88ff: Pulling fs layer 2025-12-04T09:22:29.8135866Z 3092fab73b59: Pulling fs layer 2025-12-04T09:22:29.8136210Z 6b8d5ff02e26: Pulling fs layer 2025-12-04T09:22:29.8136556Z 1aecc77354ce: Pulling fs layer 2025-12-04T09:22:29.8136904Z a51e0dab2d59: Pulling fs layer 2025-12-04T09:22:29.8137248Z 99a4918e5808: Pulling fs layer 2025-12-04T09:22:29.8137591Z 9681563a88ff: Pulling fs layer 2025-12-04T09:22:29.8137939Z bc08246bb4ba: Pulling fs layer 2025-12-04T09:22:29.8138233Z 5277f2a503eb: Pulling fs layer 2025-12-04T09:22:29.8138445Z 081028f24389: Pulling fs layer 2025-12-04T09:22:29.8968865Z 4f4fb700ef54: Already exists 2025-12-04T09:22:30.0932248Z 3c2c2f8c74bf: Download complete 2025-12-04T09:22:30.0932680Z 8e023b349080: Download complete 2025-12-04T09:22:30.0933871Z 20020dd28a15: Download complete 2025-12-04T09:22:30.0935110Z 029495b23122: Download complete 2025-12-04T09:22:30.0936136Z e7b8cf2e8d5a: Download complete 2025-12-04T09:22:30.0937186Z fe17d9eb0fd2: Download complete 2025-12-04T09:22:30.0938382Z a51e0dab2d59: Download complete 2025-12-04T09:22:30.0939600Z 2b3b5215d3eb: Download complete 2025-12-04T09:22:30.0941073Z ef3a5060abce: Download complete 2025-12-04T09:22:30.0942031Z 9681563a88ff: Download complete 2025-12-04T09:22:30.0943054Z 6c503e779d6f: Download complete 2025-12-04T09:22:30.0944281Z b168858b8537: Download complete 2025-12-04T09:22:30.0945240Z 1969e15d0c13: Download complete 2025-12-04T09:22:30.0946380Z cb19d84867e4: Download complete 2025-12-04T09:22:30.0947886Z b21856d1bf42: Download complete 2025-12-04T09:22:30.0949017Z 465d3fd643aa: Download complete 2025-12-04T09:22:30.0950049Z 835841cca3b7: Download complete 2025-12-04T09:22:30.0951114Z 522eab2402e5: Download complete 2025-12-04T09:22:30.0952276Z 816e2e34e018: Download complete 2025-12-04T09:22:30.0954068Z 2aa7784fbe33: Download complete 2025-12-04T09:22:30.0955348Z bd87c8766e90: Download complete 2025-12-04T09:22:30.0956587Z a6f4ec14b42b: Download complete 2025-12-04T09:22:30.0957920Z ff0c473ca120: Download complete 2025-12-04T09:22:30.0959212Z 59b639308833: Download complete 2025-12-04T09:22:30.0960549Z 081028f24389: Download complete 2025-12-04T09:22:30.1555637Z 835841cca3b7: Pull complete 2025-12-04T09:22:30.1931606Z 8165374f8dcc: Download complete 2025-12-04T09:22:30.1932374Z bc08246bb4ba: Download complete 2025-12-04T09:22:30.1934042Z a534dcf4b9a9: Download complete 2025-12-04T09:22:30.1935191Z 5bfdaeb5578d: Download complete 2025-12-04T09:22:30.1936742Z f7e9a021f0ee: Download complete 2025-12-04T09:22:30.1937907Z ae5280ce969d: Download complete 2025-12-04T09:22:30.1939237Z 99a4918e5808: Download complete 2025-12-04T09:22:30.1940429Z 5277f2a503eb: Download complete 2025-12-04T09:22:30.1941548Z 2b5a11b41761: Download complete 2025-12-04T09:22:30.1942816Z 3092fab73b59: Download complete 2025-12-04T09:22:30.1944166Z 3198a9717aac: Download complete 2025-12-04T09:22:30.1946368Z 6eb176cefd72: Download complete 2025-12-04T09:22:30.1947568Z 4e3b10a5dd6a: Download complete 2025-12-04T09:22:30.1949365Z 18d6daba0a57: Download complete 2025-12-04T09:22:30.1951199Z 7e5a0c956cfb: Download complete 2025-12-04T09:22:30.1952359Z d0fb85b00833: Download complete 2025-12-04T09:22:30.3581000Z c07d27e4d3a5: Download complete 2025-12-04T09:22:30.5240664Z 15bb11dfc6ac: Download complete 2025-12-04T09:22:33.3630727Z 24a03847d382: Download complete 2025-12-04T09:22:33.9611259Z 6b8d5ff02e26: Download complete 2025-12-04T09:22:34.7285001Z 6bbc14b250ef: Download complete 2025-12-04T09:22:41.1329253Z dc112c89d57a: Download complete 2025-12-04T09:22:49.4461689Z 2e77500302cc: Download complete 2025-12-04T09:23:02.2304151Z aac69780afc8: Download complete 2025-12-04T09:23:08.9581892Z 59b639308833: Pull complete 2025-12-04T09:23:09.0088552Z d0fb85b00833: Pull complete 2025-12-04T09:23:09.0315340Z aac69780afc8: Pull complete 2025-12-04T09:23:09.0461008Z 029495b23122: Pull complete 2025-12-04T09:23:10.7594540Z 2b5a11b41761: Pull complete 2025-12-04T09:23:10.7954906Z dc112c89d57a: Pull complete 2025-12-04T09:23:10.8324683Z 9681563a88ff: Pull complete 2025-12-04T09:23:10.8576968Z 522eab2402e5: Pull complete 2025-12-04T09:23:11.9191232Z 1aecc77354ce: Download complete 2025-12-04T09:23:46.1255400Z 99b1f1ea3e85: Download complete 2025-12-04T09:26:02.0335476Z 73e33534e9eb: Download complete 2025-12-04T09:26:49.6409238Z 73e33534e9eb: Pull complete 2025-12-04T09:26:49.6858109Z cb19d84867e4: Pull complete 2025-12-04T09:26:49.6889186Z b21856d1bf42: Pull complete 2025-12-04T09:26:49.7134883Z 8165374f8dcc: Pull complete 2025-12-04T09:26:49.7217459Z 5bfdaeb5578d: Pull complete 2025-12-04T09:26:49.7565706Z c07d27e4d3a5: Pull complete 2025-12-04T09:26:56.5463020Z 1aecc77354ce: Pull complete 2025-12-04T09:26:56.5589634Z 8e023b349080: Pull complete 2025-12-04T09:26:56.5782280Z 6c503e779d6f: Pull complete 2025-12-04T09:26:56.5889757Z 465d3fd643aa: Pull complete 2025-12-04T09:26:56.6164839Z f7e9a021f0ee: Pull complete 2025-12-04T09:30:21.8923212Z b4f78730cfe7: Download complete 2025-12-04T09:30:54.5034540Z 8188df80e595: Download complete 2025-12-04T09:32:06.1482306Z 2aa7784fbe33: Pull complete 2025-12-04T09:32:06.2206814Z 8188df80e595: Pull complete 2025-12-04T09:32:06.2269229Z 3c2c2f8c74bf: Pull complete 2025-12-04T09:32:06.2432307Z 2b3b5215d3eb: Pull complete 2025-12-04T09:32:13.2400418Z 99b1f1ea3e85: Pull complete 2025-12-04T09:32:13.2995967Z 99a4918e5808: Pull complete 2025-12-04T09:32:13.3035768Z 5277f2a503eb: Pull complete 2025-12-04T09:32:13.3101591Z 3198a9717aac: Pull complete 2025-12-04T09:32:13.3203320Z 18d6daba0a57: Pull complete 2025-12-04T09:32:13.3275392Z 15bb11dfc6ac: Pull complete 2025-12-04T09:32:13.3694201Z 1969e15d0c13: Pull complete 2025-12-04T09:32:13.3758273Z bd87c8766e90: Pull complete 2025-12-04T09:32:13.6170949Z b168858b8537: Pull complete 2025-12-04T09:32:13.6207216Z 816e2e34e018: Pull complete 2025-12-04T09:32:13.6565805Z 24a03847d382: Pull complete 2025-12-04T09:32:13.7425781Z 4e3b10a5dd6a: Pull complete 2025-12-04T09:32:13.7487377Z 6b8d5ff02e26: Pull complete 2025-12-04T09:32:13.8001573Z 20020dd28a15: Pull complete 2025-12-04T09:32:13.8035576Z e7b8cf2e8d5a: Pull complete 2025-12-04T09:32:13.8074751Z fe17d9eb0fd2: Pull complete 2025-12-04T09:32:13.8106810Z a51e0dab2d59: Pull complete 2025-12-04T09:32:13.8141461Z ef3a5060abce: Pull complete 2025-12-04T09:32:13.8184517Z a6f4ec14b42b: Pull complete 2025-12-04T09:32:13.8349428Z ae5280ce969d: Pull complete 2025-12-04T09:32:13.8387668Z 3092fab73b59: Pull complete 2025-12-04T09:32:13.8427560Z 6eb176cefd72: Pull complete 2025-12-04T09:32:13.8452089Z 7e5a0c956cfb: Pull complete 2025-12-04T09:33:10.5035122Z 081028f24389: Pull complete 2025-12-04T09:33:10.5101676Z a534dcf4b9a9: Pull complete 2025-12-04T09:33:10.5193370Z b4f78730cfe7: Pull complete 2025-12-04T09:33:16.6970311Z 4f4fb700ef54: Pull complete 2025-12-04T09:33:16.6997069Z ff0c473ca120: Pull complete 2025-12-04T09:33:16.7034136Z bc08246bb4ba: Pull complete 2025-12-04T09:33:16.7115950Z 2e77500302cc: Pull complete 2025-12-04T09:33:17.9745550Z 6bbc14b250ef: Pull complete 2025-12-04T09:33:17.9746285Z Digest: sha256:5e190224966743059cf8506170eaec525eada34e38cf646e02d1dbeadfe5a366 2025-12-04T09:33:17.9747080Z Status: Downloaded newer image for 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:33:17.9749983Z 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:33:17.9808289Z Prepare all required actions 2025-12-04T09:33:17.9828344Z ##[group]Run ./.github/actions/get-workflow-job-id 2025-12-04T09:33:17.9828565Z with: 2025-12-04T09:33:17.9828878Z github-token: *** 2025-12-04T09:33:17.9829043Z env: 2025-12-04T09:33:17.9829190Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:33:17.9829476Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:33:17.9829860Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:33:17.9830209Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:33:17.9830810Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:33:17.9831357Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:33:17.9831542Z AWS_REGION: us-east-1 2025-12-04T09:33:17.9831758Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:33:17.9832018Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:33:17.9835445Z AWS_SESSION_TOKEN: *** 2025-12-04T09:33:17.9835608Z ##[endgroup] 2025-12-04T09:33:17.9845992Z ##[group]Run set -eux 2025-12-04T09:33:17.9846160Z set -eux 2025-12-04T09:33:17.9846487Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2025-12-04T09:33:17.9857904Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:33:17.9858136Z env: 2025-12-04T09:33:17.9858275Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:33:17.9858529Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:33:17.9858898Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:33:17.9859252Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:33:17.9859856Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:33:17.9860410Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:33:17.9860596Z AWS_REGION: us-east-1 2025-12-04T09:33:17.9860804Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:33:17.9861052Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:33:17.9864645Z AWS_SESSION_TOKEN: *** 2025-12-04T09:33:17.9864884Z GITHUB_TOKEN: *** 2025-12-04T09:33:17.9865041Z ##[endgroup] 2025-12-04T09:33:17.9897175Z + python3 .github/scripts/get_workflow_job_id.py 19923123999 ctr-navi3x-b13-ws9 2025-12-04T09:33:18.4534037Z Setting output job-id=57118756853 2025-12-04T09:33:18.4534498Z Setting output job-name=linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:33:18.4719812Z Prepare all required actions 2025-12-04T09:33:18.4720436Z Getting action download info 2025-12-04T09:33:18.7262480Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2025-12-04T09:33:19.2878821Z Download action repository 'actions/download-artifact@v4' (SHA:d3f86a106a0bac45b974a628896c90dbdf5c8093) 2025-12-04T09:33:19.8467138Z ##[group]Run ./.github/actions/download-build-artifacts 2025-12-04T09:33:19.8467385Z with: 2025-12-04T09:33:19.8467542Z name: linux-jammy-rocm-py3.10 2025-12-04T09:33:19.8467747Z s3-bucket: gha-artifacts 2025-12-04T09:33:19.8467908Z env: 2025-12-04T09:33:19.8468054Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:33:19.8468306Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:33:19.8468675Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:33:19.8469053Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:33:19.8469659Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:33:19.8470207Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:33:19.8470393Z AWS_REGION: us-east-1 2025-12-04T09:33:19.8470629Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:33:19.8470867Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:33:19.8474292Z AWS_SESSION_TOKEN: *** 2025-12-04T09:33:19.8474454Z ##[endgroup] 2025-12-04T09:33:19.8495180Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:33:19.8495395Z with: 2025-12-04T09:33:19.8495547Z name: linux-jammy-rocm-py3.10 2025-12-04T09:33:19.8495739Z s3-bucket: gha-artifacts 2025-12-04T09:33:19.8495907Z region: us-east-1 2025-12-04T09:33:19.8496048Z env: 2025-12-04T09:33:19.8496185Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:33:19.8496444Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:33:19.8496816Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:33:19.8497176Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:33:19.8497780Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:33:19.8498326Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:33:19.8498509Z AWS_REGION: us-east-1 2025-12-04T09:33:19.8498721Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:33:19.8498971Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:33:19.8502373Z AWS_SESSION_TOKEN: *** 2025-12-04T09:33:19.8502528Z ##[endgroup] 2025-12-04T09:33:20.1841437Z (node:1911197) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:33:20.1842221Z 2025-12-04T09:33:20.1842535Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:33:20.1843318Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:33:20.1844120Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:33:20.3995644Z Found 1 objects with prefix pytorch/pytorch/19923123999/linux-jammy-rocm-py3.10/ 2025-12-04T09:33:20.3996768Z Starting download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:34:10.8935228Z Finished download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:34:10.8949587Z Artifact download has finished successfully 2025-12-04T09:34:10.9381435Z ##[group]Run unzip -o artifacts.zip 2025-12-04T09:34:10.9381714Z unzip -o artifacts.zip 2025-12-04T09:34:10.9402087Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:10.9402638Z env: 2025-12-04T09:34:10.9402965Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:10.9403955Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:10.9404837Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:10.9405649Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:10.9407204Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:10.9408490Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:10.9408907Z AWS_REGION: us-east-1 2025-12-04T09:34:10.9409427Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:10.9409976Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:10.9418165Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:10.9418547Z ##[endgroup] 2025-12-04T09:34:10.9492366Z Archive: artifacts.zip 2025-12-04T09:34:10.9493384Z creating: dist/ 2025-12-04T09:34:15.1680123Z inflating: dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:34:15.1783474Z inflating: dist/.ninja_log 2025-12-04T09:34:15.1784048Z creating: build/custom_test_artifacts/ 2025-12-04T09:34:15.1784646Z creating: build/custom_test_artifacts/custom-op-build/ 2025-12-04T09:34:15.1785375Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2025-12-04T09:34:15.1786231Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:34:15.1788196Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:34:15.1789240Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/ 2025-12-04T09:34:15.1790255Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:34:15.1791295Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:34:15.1792269Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:34:15.1793414Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:34:15.1794553Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:34:15.1795605Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:34:15.1796631Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:34:15.1797623Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:34:15.1798462Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:34:15.1798931Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:34:15.1799361Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:34:15.1799836Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:34:15.1800337Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:34:15.1800770Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:34:15.1801124Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:34:15.1801500Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2025-12-04T09:34:15.1802175Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2025-12-04T09:34:15.1802596Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2025-12-04T09:34:15.1803068Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2025-12-04T09:34:15.1803657Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2025-12-04T09:34:15.1804083Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2025-12-04T09:34:15.1804515Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2025-12-04T09:34:15.1804956Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2025-12-04T09:34:15.1805415Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2025-12-04T09:34:15.1805861Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2025-12-04T09:34:15.1806381Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2025-12-04T09:34:15.1818893Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2025-12-04T09:34:15.1973186Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2025-12-04T09:34:15.1974356Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2025-12-04T09:34:15.1975508Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2025-12-04T09:34:15.1976753Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2025-12-04T09:34:15.1977952Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2025-12-04T09:34:15.1978941Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2025-12-04T09:34:15.1979719Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2025-12-04T09:34:15.1980500Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2025-12-04T09:34:15.1981278Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2025-12-04T09:34:15.1982039Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2025-12-04T09:34:15.1982799Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2025-12-04T09:34:15.1992015Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2025-12-04T09:34:15.2053471Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2025-12-04T09:34:15.2054842Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:34:15.2055978Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:34:15.2056996Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2025-12-04T09:34:15.2057965Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2025-12-04T09:34:15.2058830Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2025-12-04T09:34:15.2059472Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:34:15.2060093Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:34:15.2060670Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2025-12-04T09:34:15.2061593Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2025-12-04T09:34:15.2062132Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2025-12-04T09:34:15.2186091Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2025-12-04T09:34:15.2227371Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2025-12-04T09:34:15.2228676Z creating: build/custom_test_artifacts/jit-hook-build/ 2025-12-04T09:34:15.2229420Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2025-12-04T09:34:15.2230258Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:34:15.2231241Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:34:15.2232184Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/ 2025-12-04T09:34:15.2233109Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:34:15.2234116Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:34:15.2235082Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:34:15.2236265Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:34:15.2237395Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:34:15.2238455Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:34:15.2239466Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:34:15.2240435Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:34:15.2241591Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:34:15.2242769Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:34:15.2243836Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:34:15.2244992Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:34:15.2246379Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:34:15.2247472Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:34:15.2248318Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:34:15.2249189Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2025-12-04T09:34:15.2250117Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2025-12-04T09:34:15.2251159Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2025-12-04T09:34:15.2252359Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2025-12-04T09:34:15.2253513Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2025-12-04T09:34:15.2254592Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2025-12-04T09:34:15.2255691Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2025-12-04T09:34:15.2256808Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2025-12-04T09:34:15.2257928Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2025-12-04T09:34:15.2259029Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2025-12-04T09:34:15.2260124Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2025-12-04T09:34:15.2262248Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2025-12-04T09:34:15.2310282Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2025-12-04T09:34:15.2311961Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:34:15.2313117Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:34:15.2314147Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2025-12-04T09:34:15.2315074Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2025-12-04T09:34:15.2315981Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2025-12-04T09:34:15.2316904Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:34:15.2317829Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:34:15.2318676Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2025-12-04T09:34:15.2319446Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2025-12-04T09:34:15.2320239Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2025-12-04T09:34:15.2343931Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2025-12-04T09:34:15.2344782Z creating: build/custom_test_artifacts/custom-backend-build/ 2025-12-04T09:34:15.2345576Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2025-12-04T09:34:15.2346497Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:34:15.2347586Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:34:15.2348618Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/ 2025-12-04T09:34:15.2349645Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:34:15.2350723Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:34:15.2351778Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:34:15.2352977Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:34:15.2354173Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:34:15.2355329Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:34:15.2356418Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:34:15.2357513Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:34:15.2358975Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:34:15.2360012Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:34:15.2360765Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:34:15.2361515Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:34:15.2362257Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:34:15.2362946Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:34:15.2363502Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:34:15.2364251Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2025-12-04T09:34:15.2364879Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2025-12-04T09:34:15.2365539Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2025-12-04T09:34:15.2378822Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2025-12-04T09:34:15.2380074Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2025-12-04T09:34:15.2381319Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2025-12-04T09:34:15.2382561Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2025-12-04T09:34:15.2383814Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2025-12-04T09:34:15.2385083Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2025-12-04T09:34:15.2386313Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2025-12-04T09:34:15.2387551Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2025-12-04T09:34:15.2388889Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2025-12-04T09:34:15.2455821Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2025-12-04T09:34:15.2457081Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2025-12-04T09:34:15.2458325Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2025-12-04T09:34:15.2459698Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2025-12-04T09:34:15.2460938Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2025-12-04T09:34:15.2462127Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2025-12-04T09:34:15.2463337Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2025-12-04T09:34:15.2464546Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2025-12-04T09:34:15.2465751Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2025-12-04T09:34:15.2466954Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2025-12-04T09:34:15.2468148Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2025-12-04T09:34:15.2475539Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2025-12-04T09:34:15.2516618Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2025-12-04T09:34:15.2517910Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:34:15.2519043Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:34:15.2520045Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2025-12-04T09:34:15.2520979Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2025-12-04T09:34:15.2521880Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2025-12-04T09:34:15.2523102Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:34:15.2524037Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:34:15.2524903Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2025-12-04T09:34:15.2525909Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2025-12-04T09:34:15.2526830Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2025-12-04T09:34:15.2597713Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2025-12-04T09:34:15.2626937Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2025-12-04T09:34:15.2627590Z creating: build/lib/ 2025-12-04T09:34:15.2687515Z inflating: build/lib/libprotobuf-lite.a 2025-12-04T09:34:15.3011450Z inflating: build/lib/libprotobuf.a 2025-12-04T09:34:15.3372517Z inflating: build/lib/libprotoc.a 2025-12-04T09:34:15.3379360Z inflating: build/lib/libpthreadpool.a 2025-12-04T09:34:15.3385315Z inflating: build/lib/libcpuinfo.a 2025-12-04T09:34:15.3390940Z inflating: build/lib/libcpuinfo_internals.a 2025-12-04T09:34:15.3391544Z inflating: build/lib/libclog.a 2025-12-04T09:34:15.3405173Z inflating: build/lib/libpytorch_qnnpack.a 2025-12-04T09:34:15.3406089Z inflating: build/lib/libnnpack_reference_layers.a 2025-12-04T09:34:15.3419547Z inflating: build/lib/libnnpack.a 2025-12-04T09:34:15.3554611Z inflating: build/lib/libmicrokernels-prod.a 2025-12-04T09:34:15.4188510Z inflating: build/lib/libmicrokernels-all.a 2025-12-04T09:34:15.4239464Z inflating: build/lib/libgtest.a 2025-12-04T09:34:15.4251925Z inflating: build/lib/libgmock.a 2025-12-04T09:34:15.4252506Z inflating: build/lib/libgmock_main.a 2025-12-04T09:34:15.4253007Z inflating: build/lib/libgtest_main.a 2025-12-04T09:34:15.4318480Z inflating: build/lib/libXNNPACK.a 2025-12-04T09:34:15.4373486Z inflating: build/lib/libbenchmark.a 2025-12-04T09:34:15.4374082Z inflating: build/lib/libbenchmark_main.a 2025-12-04T09:34:15.4421000Z inflating: build/lib/libasmjit.a 2025-12-04T09:34:15.4421609Z inflating: build/lib/libjitprofiling.a 2025-12-04T09:34:15.4427308Z inflating: build/lib/libittnotify.a 2025-12-04T09:34:15.5310624Z inflating: build/lib/libfbgemm.a 2025-12-04T09:34:15.5332619Z inflating: build/lib/libtensorpipe_uv.a 2025-12-04T09:34:15.5734638Z inflating: build/lib/libtensorpipe.a 2025-12-04T09:34:15.5822952Z inflating: build/lib/libgloo.a 2025-12-04T09:34:15.5857696Z inflating: build/lib/libonnx_proto.a 2025-12-04T09:34:15.6182661Z inflating: build/lib/libgloo_hip.a 2025-12-04T09:34:15.6708748Z inflating: build/lib/libonnx.a 2025-12-04T09:34:16.4286923Z inflating: build/lib/libdnnl.a 2025-12-04T09:34:16.4301119Z inflating: build/lib/libfmt.a 2025-12-04T09:34:16.4525449Z inflating: build/lib/libkineto.a 2025-12-04T09:34:16.4609267Z inflating: build/lib/libc10.so 2025-12-04T09:34:16.4609899Z inflating: build/lib/libtorch_global_deps.so 2025-12-04T09:34:16.4642041Z inflating: build/lib/libc10_hip.so 2025-12-04T09:34:16.4642622Z inflating: build/lib/libcaffe2_nvrtc.so 2025-12-04T09:34:16.5013554Z inflating: build/lib/libfbgemm_genai.a 2025-12-04T09:34:18.7507715Z inflating: build/lib/libtorch_cpu.so 2025-12-04T09:34:18.7510754Z inflating: build/lib/libshm.so 2025-12-04T09:34:19.8546237Z inflating: build/lib/libtorch_hip.so 2025-12-04T09:34:19.8546821Z inflating: build/lib/libtorch.so 2025-12-04T09:34:19.8561087Z inflating: build/lib/libjitbackend_test.so 2025-12-04T09:34:19.8614596Z inflating: build/lib/libtorchbind_test.so 2025-12-04T09:34:19.8632093Z inflating: build/lib/libbackend_with_compiler.so 2025-12-04T09:34:19.8651148Z inflating: build/lib/libaoti_custom_ops.so 2025-12-04T09:34:20.0375821Z inflating: build/lib/libtorch_python.so 2025-12-04T09:34:20.0401516Z inflating: build/lib/libnnapi_backend.so 2025-12-04T09:34:20.0402067Z creating: build/bin/ 2025-12-04T09:34:20.0402477Z creating: build/bin/CMakeFiles/ 2025-12-04T09:34:20.0402968Z inflating: build/bin/cmake_install.cmake 2025-12-04T09:34:20.0403489Z inflating: build/bin/CTestTestfile.cmake 2025-12-04T09:34:20.0731881Z inflating: build/bin/protoc-3.13.0.0 2025-12-04T09:34:20.1060836Z inflating: build/bin/protoc 2025-12-04T09:34:20.1103088Z inflating: build/bin/c10_AllocatorConfig_test 2025-12-04T09:34:20.1144251Z inflating: build/bin/c10_DeviceGuard_test 2025-12-04T09:34:20.1185573Z inflating: build/bin/c10_Device_test 2025-12-04T09:34:20.1225641Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2025-12-04T09:34:20.1272608Z inflating: build/bin/c10_DispatchKeySet_test 2025-12-04T09:34:20.1315700Z inflating: build/bin/c10_Scalar_test 2025-12-04T09:34:20.1355086Z inflating: build/bin/c10_StreamGuard_test 2025-12-04T09:34:20.1400086Z inflating: build/bin/c10_SymInt_test 2025-12-04T09:34:20.1444301Z inflating: build/bin/c10_InlineStreamGuard_test 2025-12-04T09:34:20.1487425Z inflating: build/bin/c10_InlineDeviceGuard_test 2025-12-04T09:34:20.1526984Z inflating: build/bin/c10_ArrayRef_test 2025-12-04T09:34:20.1571481Z inflating: build/bin/c10_SizesAndStrides_test 2025-12-04T09:34:20.1626432Z inflating: build/bin/c10_cow_test 2025-12-04T09:34:20.1665590Z inflating: build/bin/c10_ConstexprCrc_test 2025-12-04T09:34:20.1707666Z inflating: build/bin/c10_Bitset_test 2025-12-04T09:34:20.1747344Z inflating: build/bin/c10_DeadlockDetection_test 2025-12-04T09:34:20.1787731Z inflating: build/bin/c10_Half_test 2025-12-04T09:34:20.1832242Z inflating: build/bin/c10_LeftRight_test 2025-12-04T09:34:20.1877506Z inflating: build/bin/c10_Enumerate_test 2025-12-04T09:34:20.1919480Z inflating: build/bin/c10_IntrusiveList_test 2025-12-04T09:34:20.1961743Z inflating: build/bin/c10_NetworkFlow_test 2025-12-04T09:34:20.2001319Z inflating: build/bin/c10_Semaphore_test 2025-12-04T09:34:20.2041253Z inflating: build/bin/c10_Synchronized_test 2025-12-04T09:34:20.2085276Z inflating: build/bin/c10_ThreadLocal_test 2025-12-04T09:34:20.2126579Z inflating: build/bin/c10_TypeIndex_test 2025-12-04T09:34:20.2167695Z inflating: build/bin/c10_accumulate_test 2025-12-04T09:34:20.2211938Z inflating: build/bin/c10_bfloat16_test 2025-12-04T09:34:20.2252153Z inflating: build/bin/c10_bit_cast_test 2025-12-04T09:34:20.2297213Z inflating: build/bin/c10_complex_math_test 2025-12-04T09:34:20.2336718Z inflating: build/bin/c10_error_test 2025-12-04T09:34:20.2378410Z inflating: build/bin/c10_exception_test 2025-12-04T09:34:20.2422120Z inflating: build/bin/c10_complex_test 2025-12-04T09:34:20.2462238Z inflating: build/bin/c10_flags_test 2025-12-04T09:34:20.2502397Z inflating: build/bin/c10_generic_math_test 2025-12-04T09:34:20.2544793Z inflating: build/bin/c10_lazy_test 2025-12-04T09:34:20.2585492Z inflating: build/bin/c10_irange_test 2025-12-04T09:34:20.2708340Z inflating: build/bin/c10_intrusive_ptr_test 2025-12-04T09:34:20.2753660Z inflating: build/bin/c10_logging_test 2025-12-04T09:34:20.2793442Z inflating: build/bin/c10_nofatal_test 2025-12-04T09:34:20.2835732Z inflating: build/bin/c10_registry_test 2025-12-04T09:34:20.2894654Z inflating: build/bin/c10_optional_test 2025-12-04T09:34:20.2943461Z inflating: build/bin/c10_ordered_preserving_dict_test 2025-12-04T09:34:20.3063273Z inflating: build/bin/c10_small_vector_test 2025-12-04T09:34:20.3104328Z inflating: build/bin/c10_ssize_test 2025-12-04T09:34:20.3149147Z inflating: build/bin/c10_string_util_test 2025-12-04T09:34:20.3189186Z inflating: build/bin/c10_tempfile_test 2025-12-04T09:34:20.3224189Z inflating: build/bin/c10_intrusive_ptr_benchmark 2025-12-04T09:34:20.3268723Z inflating: build/bin/c10_typeid_test 2025-12-04T09:34:20.3307886Z inflating: build/bin/c10_string_view_test 2025-12-04T09:34:20.3347085Z inflating: build/bin/c10_hip_HIPAssertionsTest_1_var_test 2025-12-04T09:34:20.3386132Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_stream 2025-12-04T09:34:20.3425308Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2025-12-04T09:34:20.3464311Z inflating: build/bin/c10_hip_HIPAssertionsTest_from_2_processes 2025-12-04T09:34:20.3503876Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2025-12-04T09:34:20.3542654Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2025-12-04T09:34:20.3581700Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2025-12-04T09:34:20.3620871Z inflating: build/bin/c10_hip_HIPTest 2025-12-04T09:34:20.4068662Z inflating: build/bin/vec_test_all_types_DEFAULT 2025-12-04T09:34:20.4528240Z inflating: build/bin/vec_test_all_types_AVX512 2025-12-04T09:34:20.4992468Z inflating: build/bin/vec_test_all_types_AVX2 2025-12-04T09:34:20.5068644Z inflating: build/bin/test_aoti_abi_check 2025-12-04T09:34:20.5108316Z inflating: build/bin/test_vec_half_AVX512 2025-12-04T09:34:20.5147625Z inflating: build/bin/test_vec_half_DEFAULT 2025-12-04T09:34:20.5187276Z inflating: build/bin/test_vec_half_AVX2 2025-12-04T09:34:20.5229000Z inflating: build/bin/BackoffTest 2025-12-04T09:34:20.5271117Z inflating: build/bin/FileStoreTest 2025-12-04T09:34:20.5316010Z inflating: build/bin/TCPStoreTest 2025-12-04T09:34:20.5358514Z inflating: build/bin/HashStoreTest 2025-12-04T09:34:20.5410993Z inflating: build/bin/ProcessGroupGlooTest 2025-12-04T09:34:20.5412373Z inflating: build/bin/example_allreduce 2025-12-04T09:34:20.5415749Z inflating: build/bin/torch_shm_manager 2025-12-04T09:34:20.5458109Z inflating: build/bin/static_runtime_bench 2025-12-04T09:34:20.5650789Z inflating: build/bin/static_runtime_test 2025-12-04T09:34:20.5708357Z inflating: build/bin/Dict_test 2025-12-04T09:34:20.5749938Z inflating: build/bin/Dimname_test 2025-12-04T09:34:20.5801298Z inflating: build/bin/MaybeOwned_test 2025-12-04T09:34:20.5846294Z inflating: build/bin/NamedTensor_test 2025-12-04T09:34:20.5892638Z inflating: build/bin/apply_utils_test 2025-12-04T09:34:20.5939015Z inflating: build/bin/atest 2025-12-04T09:34:20.5989017Z inflating: build/bin/basic 2025-12-04T09:34:20.6031973Z inflating: build/bin/broadcast_test 2025-12-04T09:34:20.6072292Z inflating: build/bin/cpu_allocator_test 2025-12-04T09:34:20.6118066Z inflating: build/bin/cpu_generator_test 2025-12-04T09:34:20.6159783Z inflating: build/bin/cpu_profiling_allocator_test 2025-12-04T09:34:20.6231366Z inflating: build/bin/cpu_rng_test 2025-12-04T09:34:20.6272210Z inflating: build/bin/dlconvertor_test 2025-12-04T09:34:20.6317735Z inflating: build/bin/extension_backend_test 2025-12-04T09:34:20.6361653Z inflating: build/bin/half_test 2025-12-04T09:34:20.6436745Z inflating: build/bin/ivalue_test 2025-12-04T09:34:20.6476311Z inflating: build/bin/lazy_tensor_test 2025-12-04T09:34:20.6518280Z inflating: build/bin/math_kernel_test 2025-12-04T09:34:20.6560263Z inflating: build/bin/memory_format_test 2025-12-04T09:34:20.6602509Z inflating: build/bin/memory_overlapping_test 2025-12-04T09:34:20.6644739Z inflating: build/bin/mobile_memory_cleanup 2025-12-04T09:34:20.6688962Z inflating: build/bin/native_test 2025-12-04T09:34:20.6729576Z inflating: build/bin/operator_name_test 2025-12-04T09:34:20.6770046Z inflating: build/bin/operators_test 2025-12-04T09:34:20.6811517Z inflating: build/bin/packedtensoraccessor_test 2025-12-04T09:34:20.6864505Z inflating: build/bin/pow_test 2025-12-04T09:34:20.6909104Z inflating: build/bin/quantized_test 2025-12-04T09:34:20.6948906Z inflating: build/bin/reduce_ops_test 2025-12-04T09:34:20.6989520Z inflating: build/bin/reportMemoryUsage_test 2025-12-04T09:34:20.7033618Z inflating: build/bin/scalar_tensor_test 2025-12-04T09:34:20.7078826Z inflating: build/bin/scalar_test 2025-12-04T09:34:20.7119703Z inflating: build/bin/StorageUtils_test 2025-12-04T09:34:20.7160820Z inflating: build/bin/stride_properties_test 2025-12-04T09:34:20.7223198Z inflating: build/bin/tensor_iterator_test 2025-12-04T09:34:20.7266216Z inflating: build/bin/test_parallel 2025-12-04T09:34:20.7306760Z inflating: build/bin/thread_init_test 2025-12-04T09:34:20.7349820Z inflating: build/bin/type_ptr_test 2025-12-04T09:34:20.7396546Z inflating: build/bin/type_test 2025-12-04T09:34:20.7438257Z inflating: build/bin/undefined_tensor_test 2025-12-04T09:34:20.7477649Z inflating: build/bin/verify_api_visibility 2025-12-04T09:34:20.7533200Z inflating: build/bin/legacy_vmap_test 2025-12-04T09:34:20.7580160Z inflating: build/bin/IListRef_test 2025-12-04T09:34:20.7620793Z inflating: build/bin/xla_tensor_test 2025-12-04T09:34:20.7661582Z inflating: build/bin/wrapdim_test 2025-12-04T09:34:20.7702338Z inflating: build/bin/weakref_test 2025-12-04T09:34:20.7784859Z inflating: build/bin/List_test 2025-12-04T09:34:20.7836761Z inflating: build/bin/KernelFunction_test 2025-12-04T09:34:20.7911101Z inflating: build/bin/kernel_function_test 2025-12-04T09:34:20.8003861Z inflating: build/bin/kernel_function_legacy_test 2025-12-04T09:34:20.8101581Z inflating: build/bin/kernel_lambda_legacy_test 2025-12-04T09:34:20.8181168Z inflating: build/bin/kernel_lambda_test 2025-12-04T09:34:20.8228425Z inflating: build/bin/kernel_stackbased_test 2025-12-04T09:34:20.8302271Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2025-12-04T09:34:20.8342765Z inflating: build/bin/CppSignature_test 2025-12-04T09:34:20.8381768Z inflating: build/bin/op_allowlist_test 2025-12-04T09:34:20.8434136Z inflating: build/bin/inline_container_test 2025-12-04T09:34:20.8673310Z inflating: build/bin/op_registration_test 2025-12-04T09:34:20.8716828Z inflating: build/bin/backend_fallback_test 2025-12-04T09:34:20.8755926Z inflating: build/bin/hip_complex_math_test 2025-12-04T09:34:20.8797944Z inflating: build/bin/hip_apply_test 2025-12-04T09:34:20.8837146Z inflating: build/bin/hip_complex_test 2025-12-04T09:34:20.8876784Z inflating: build/bin/hip_distributions_test 2025-12-04T09:34:20.8915794Z inflating: build/bin/hip_generator_test 2025-12-04T09:34:20.8954602Z inflating: build/bin/hip_half_test 2025-12-04T09:34:20.8993685Z inflating: build/bin/hip_integer_divider_test 2025-12-04T09:34:20.9032695Z inflating: build/bin/hip_packedtensoraccessor_test 2025-12-04T09:34:20.9071823Z inflating: build/bin/hip_optional_test 2025-12-04T09:34:20.9110853Z inflating: build/bin/hip_vectorized_test 2025-12-04T09:34:20.9151764Z inflating: build/bin/hip_dlconvertor_test 2025-12-04T09:34:20.9967302Z inflating: build/bin/test_jit 2025-12-04T09:34:21.0011145Z inflating: build/bin/test_dist_autograd 2025-12-04T09:34:21.0277904Z inflating: build/bin/test_lazy 2025-12-04T09:34:21.0331587Z inflating: build/bin/test_cpp_rpc 2025-12-04T09:34:21.1208592Z inflating: build/bin/test_api 2025-12-04T09:34:21.1209687Z inflating: build/bin/parallel_benchmark 2025-12-04T09:34:21.1210214Z creating: .additional_ci_files/ 2025-12-04T09:34:21.1257237Z inflating: .additional_ci_files/test-times.json 2025-12-04T09:34:21.1426958Z inflating: .additional_ci_files/test-class-times.json 2025-12-04T09:34:21.1470612Z ##[group]Run rm artifacts.zip 2025-12-04T09:34:21.1471084Z rm artifacts.zip 2025-12-04T09:34:21.1496610Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:21.1497178Z env: 2025-12-04T09:34:21.1497517Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:21.1498138Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:21.1499053Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:21.1500240Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:21.1501735Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:21.1503058Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:21.1503489Z AWS_REGION: us-east-1 2025-12-04T09:34:21.1504313Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:21.1504974Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:21.1513645Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:21.1514032Z ##[endgroup] 2025-12-04T09:34:21.3953804Z ##[group]Run df -H 2025-12-04T09:34:21.3954133Z df -H 2025-12-04T09:34:21.3976239Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:21.3976712Z env: 2025-12-04T09:34:21.3976996Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:21.3977508Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:21.3978270Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:21.3978946Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:21.3980135Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:21.3981226Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:21.3981591Z AWS_REGION: us-east-1 2025-12-04T09:34:21.3982015Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:21.3982486Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:21.3989273Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:21.3989597Z ##[endgroup] 2025-12-04T09:34:21.4051355Z Filesystem Size Used Avail Use% Mounted on 2025-12-04T09:34:21.4051715Z tmpfs 28G 17M 27G 1% /run 2025-12-04T09:34:21.4052052Z /dev/mapper/ubuntu--vg-ubuntu--lv 3.8T 536G 3.1T 15% / 2025-12-04T09:34:21.4052402Z tmpfs 136G 4.1k 136G 1% /dev/shm 2025-12-04T09:34:21.4052684Z tmpfs 5.3M 0 5.3M 0% /run/lock 2025-12-04T09:34:21.4052980Z /dev/nvme0n1p2 2.1G 294M 1.7G 16% /boot 2025-12-04T09:34:21.4053295Z /dev/nvme0n1p1 1.2G 6.4M 1.2G 1% /boot/efi 2025-12-04T09:34:21.4053600Z tmpfs 28G 50k 28G 1% /run/user/1003 2025-12-04T09:34:21.4104794Z Prepare all required actions 2025-12-04T09:34:21.4105393Z Getting action download info 2025-12-04T09:34:21.5897706Z ##[group]Run ./.github/actions/download-td-artifacts 2025-12-04T09:34:21.5898236Z with: 2025-12-04T09:34:21.5898537Z env: 2025-12-04T09:34:21.5898841Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:21.5899440Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:21.5900316Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:21.5901154Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:21.5902559Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:21.5903838Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:21.5904262Z AWS_REGION: us-east-1 2025-12-04T09:34:21.5904792Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:21.5905352Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:21.5913531Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:21.5913911Z ##[endgroup] 2025-12-04T09:34:21.5958709Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:34:21.5959219Z with: 2025-12-04T09:34:21.5959534Z name: td_results 2025-12-04T09:34:21.5959890Z s3-bucket: gha-artifacts 2025-12-04T09:34:21.5960277Z region: us-east-1 2025-12-04T09:34:21.5960600Z env: 2025-12-04T09:34:21.5960911Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:21.5961848Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:21.5962721Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:21.5963539Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:21.5964986Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:21.5966454Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:21.5966881Z AWS_REGION: us-east-1 2025-12-04T09:34:21.5967337Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:21.5967969Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:21.5976160Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:21.5976547Z ##[endgroup] 2025-12-04T09:34:21.9315700Z (node:1911225) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:34:21.9316482Z 2025-12-04T09:34:21.9316778Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:34:21.9317556Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:34:21.9318376Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:34:22.0748116Z Found 1 objects with prefix pytorch/pytorch/19923123999/td_results/ 2025-12-04T09:34:22.0749176Z Starting download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:34:22.2453850Z Finished download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:34:22.2463918Z Artifact download has finished successfully 2025-12-04T09:34:22.2922956Z ##[group]Run mkdir -p .additional_ci_files 2025-12-04T09:34:22.2923514Z mkdir -p .additional_ci_files 2025-12-04T09:34:22.2924164Z mv td_results.json .additional_ci_files/td_results.json || true 2025-12-04T09:34:22.2950334Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:22.2950931Z env: 2025-12-04T09:34:22.2951264Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:22.2951876Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:22.2952748Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:22.2953559Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:22.2955347Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:22.2956658Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:22.2957082Z AWS_REGION: us-east-1 2025-12-04T09:34:22.2957568Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:22.2958125Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:22.2966390Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:22.2966771Z ##[endgroup] 2025-12-04T09:34:22.3090376Z ##[group]Run .github/scripts/parse_ref.py 2025-12-04T09:34:22.3090958Z .github/scripts/parse_ref.py 2025-12-04T09:34:22.3117517Z shell: /usr/bin/bash -e {0} 2025-12-04T09:34:22.3117943Z env: 2025-12-04T09:34:22.3118283Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:22.3118908Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:22.3119795Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:22.3120617Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:22.3122042Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:22.3123332Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:22.3123760Z AWS_REGION: us-east-1 2025-12-04T09:34:22.3124284Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:22.3124894Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:22.3133520Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:22.3133901Z ##[endgroup] 2025-12-04T09:34:22.3308700Z Setting output branch=main 2025-12-04T09:34:22.3392945Z Prepare all required actions 2025-12-04T09:34:22.3393218Z Getting action download info 2025-12-04T09:34:22.5097216Z ##[group]Run ./.github/actions/filter-test-configs 2025-12-04T09:34:22.5097437Z with: 2025-12-04T09:34:22.5097711Z github-token: *** 2025-12-04T09:34:22.5098145Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}]} 2025-12-04T09:34:22.5098698Z job-name: linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:34:22.5098969Z env: 2025-12-04T09:34:22.5099105Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:22.5099348Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:22.5099698Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:22.5100080Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:22.5100646Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:22.5101160Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:22.5101334Z AWS_REGION: us-east-1 2025-12-04T09:34:22.5101513Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:22.5101757Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:22.5104999Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:22.5105158Z ##[endgroup] 2025-12-04T09:34:22.5127825Z ##[group]Run nick-fields/retry@v3.0.0 2025-12-04T09:34:22.5128010Z with: 2025-12-04T09:34:22.5128138Z shell: bash 2025-12-04T09:34:22.5128277Z timeout_minutes: 10 2025-12-04T09:34:22.5128425Z max_attempts: 5 2025-12-04T09:34:22.5128571Z retry_wait_seconds: 30 2025-12-04T09:34:22.5129054Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:34:22.5129539Z polling_interval_seconds: 1 2025-12-04T09:34:22.5129714Z warning_on_retry: true 2025-12-04T09:34:22.5129873Z continue_on_error: false 2025-12-04T09:34:22.5130029Z env: 2025-12-04T09:34:22.5130160Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:22.5130404Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:22.5130762Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:22.5131098Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:22.5131680Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:22.5132212Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:22.5132390Z AWS_REGION: us-east-1 2025-12-04T09:34:22.5132583Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:22.5132819Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:22.5136080Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:22.5136313Z GITHUB_TOKEN: *** 2025-12-04T09:34:22.5136466Z ##[endgroup] 2025-12-04T09:34:22.5684455Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:34:22.7700537Z Defaulting to user installation because normal site-packages is not writeable 2025-12-04T09:34:22.8332482Z Requirement already satisfied: requests==2.27.1 in /home/pytorchci/.local/lib/python3.10/site-packages (2.27.1) 2025-12-04T09:34:22.8336160Z Requirement already satisfied: pyyaml==6.0.2 in /home/pytorchci/.local/lib/python3.10/site-packages (6.0.2) 2025-12-04T09:34:22.8411157Z Requirement already satisfied: certifi>=2017.4.17 in /usr/lib/python3/dist-packages (from requests==2.27.1) (2020.6.20) 2025-12-04T09:34:22.8414878Z Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/lib/python3/dist-packages (from requests==2.27.1) (1.26.5) 2025-12-04T09:34:22.8423538Z Requirement already satisfied: charset-normalizer~=2.0.0 in /home/pytorchci/.local/lib/python3.10/site-packages (from requests==2.27.1) (2.0.12) 2025-12-04T09:34:22.8428940Z Requirement already satisfied: idna<4,>=2.5 in /usr/lib/python3/dist-packages (from requests==2.27.1) (3.3) 2025-12-04T09:34:23.5695051Z Command completed after 1 attempt(s). 2025-12-04T09:34:23.5766569Z ##[group]Run set -x 2025-12-04T09:34:23.5766746Z set -x 2025-12-04T09:34:23.5766883Z  2025-12-04T09:34:23.5767120Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:34:23.5767408Z # in runner workspace 2025-12-04T09:34:23.5767643Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2025-12-04T09:34:23.5778622Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:23.5778870Z env: 2025-12-04T09:34:23.5779011Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:23.5779272Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:23.5779659Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:23.5780012Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:23.5780653Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:23.5781181Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:23.5781358Z AWS_REGION: us-east-1 2025-12-04T09:34:23.5781562Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:23.5781842Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:23.5785088Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:23.5785247Z ##[endgroup] 2025-12-04T09:34:23.5815467Z + python3 /home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2025-12-04T09:34:23.5948872Z Setting output branch=main 2025-12-04T09:34:23.6003868Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:34:23.6004472Z echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:34:23.6004966Z echo "Job name: ${JOB_NAME}" 2025-12-04T09:34:23.6005391Z  2025-12-04T09:34:23.6005950Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:34:23.6006786Z # in runner workspace 2025-12-04T09:34:23.6007409Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2025-12-04T09:34:23.6008107Z  --workflow "${GITHUB_WORKFLOW}" \ 2025-12-04T09:34:23.6008599Z  --job-name "${JOB_NAME}" \ 2025-12-04T09:34:23.6009796Z  --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}]}" \ 2025-12-04T09:34:23.6011036Z  --selected-test-configs "" \ 2025-12-04T09:34:23.6011542Z  --pr-number "${PR_NUMBER}" \ 2025-12-04T09:34:23.6011995Z  --tag "${TAG}" \ 2025-12-04T09:34:23.6012424Z  --event-name "${EVENT_NAME}" \ 2025-12-04T09:34:23.6012894Z  --schedule "${SCHEDULE}" \ 2025-12-04T09:34:23.6013338Z  --branch "${HEAD_BRANCH}" 2025-12-04T09:34:23.6039631Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:23.6040206Z env: 2025-12-04T09:34:23.6040534Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:23.6041140Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:23.6042017Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:23.6042834Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:23.6044585Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:23.6046298Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:23.6046723Z AWS_REGION: us-east-1 2025-12-04T09:34:23.6047212Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:23.6047775Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:23.6055942Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:23.6056543Z GITHUB_TOKEN: *** 2025-12-04T09:34:23.6057149Z JOB_NAME: linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:34:23.6057824Z PR_NUMBER: 2025-12-04T09:34:23.6058148Z TAG: 2025-12-04T09:34:23.6058458Z EVENT_NAME: schedule 2025-12-04T09:34:23.6058837Z SCHEDULE: 45 */2 * * 1-5 2025-12-04T09:34:23.6059213Z HEAD_BRANCH: main 2025-12-04T09:34:23.6059553Z ##[endgroup] 2025-12-04T09:34:23.6112982Z Workflow: rocm-navi31 2025-12-04T09:34:23.6113689Z Job name: linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:34:23.9065873Z Setting output keep-going=True 2025-12-04T09:34:23.9066207Z Setting output ci-verbose-test-logs=False 2025-12-04T09:34:23.9066505Z Setting output ci-test-showlocals=False 2025-12-04T09:34:23.9066755Z Setting output ci-no-test-timeout=False 2025-12-04T09:34:23.9066995Z Setting output ci-no-td=False 2025-12-04T09:34:23.9067224Z Setting output ci-td-distributed=False 2025-12-04T09:34:23.9067459Z Setting output is-unstable=False 2025-12-04T09:34:23.9067685Z Setting output reenabled-issues= 2025-12-04T09:34:23.9068292Z Setting output test-matrix={"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}]} 2025-12-04T09:34:23.9068915Z Setting output is-test-matrix-empty=False 2025-12-04T09:34:23.9273093Z ##[group]Run echo "Filtered matrix:" 2025-12-04T09:34:23.9273606Z echo "Filtered matrix:" 2025-12-04T09:34:23.9274776Z echo "{"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu.gfx1100"}]}" 2025-12-04T09:34:23.9275898Z  2025-12-04T09:34:23.9276199Z echo 2025-12-04T09:34:23.9276614Z echo "Is the current job unstable? False" 2025-12-04T09:34:23.9277106Z  2025-12-04T09:34:23.9277399Z echo 2025-12-04T09:34:23.9277772Z echo "Is keep-going label set? True" 2025-12-04T09:34:23.9278236Z  2025-12-04T09:34:23.9278532Z echo 2025-12-04T09:34:23.9278871Z echo "Reenabled issues? " 2025-12-04T09:34:23.9304649Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:23.9305219Z env: 2025-12-04T09:34:23.9305549Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:23.9306160Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:23.9307048Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:23.9307888Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:23.9309320Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:23.9310609Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:23.9311034Z AWS_REGION: us-east-1 2025-12-04T09:34:23.9311513Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:23.9312069Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:23.9320276Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:23.9320663Z ##[endgroup] 2025-12-04T09:34:23.9369721Z Filtered matrix: 2025-12-04T09:34:23.9370816Z {include: [{config: default, shard: 1, num_shards: 2, runner: linux.rocm.gpu.gfx1100}, {config: default, shard: 2, num_shards: 2, runner: linux.rocm.gpu.gfx1100}]} 2025-12-04T09:34:23.9371741Z 2025-12-04T09:34:23.9371908Z Is the current job unstable? False 2025-12-04T09:34:23.9372696Z 2025-12-04T09:34:23.9372890Z Is keep-going label set? True 2025-12-04T09:34:23.9373222Z 2025-12-04T09:34:23.9373641Z Reenabled issues? 2025-12-04T09:34:23.9434058Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:34:23.9434858Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:34:23.9461049Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:23.9461626Z env: 2025-12-04T09:34:23.9461960Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:23.9462575Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:23.9463486Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:23.9464331Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:23.9465788Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:23.9467138Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:23.9467617Z AWS_REGION: us-east-1 2025-12-04T09:34:23.9468116Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:23.9468690Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:23.9477007Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:23.9477408Z JOB_TIMEOUT: 300 2025-12-04T09:34:23.9477754Z ##[endgroup] 2025-12-04T09:34:23.9571043Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:34:23.9571842Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:34:23.9572530Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:34:23.9596656Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:34:23.9597225Z env: 2025-12-04T09:34:23.9597560Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:23.9598172Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:23.9599072Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:23.9599885Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:23.9601317Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:23.9602652Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:23.9603085Z AWS_REGION: us-east-1 2025-12-04T09:34:23.9603546Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:23.9604226Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:23.9612535Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:23.9612923Z ##[endgroup] 2025-12-04T09:34:23.9739678Z ##[group]Run set -x 2025-12-04T09:34:23.9740001Z set -x 2025-12-04T09:34:23.9740231Z  2025-12-04T09:34:23.9740491Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2025-12-04T09:34:23.9740902Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2025-12-04T09:34:23.9741313Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2025-12-04T09:34:23.9741686Z  TEST_COMMAND=.ci/caffe2/test.sh 2025-12-04T09:34:23.9741991Z else 2025-12-04T09:34:23.9742247Z  TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:34:23.9742551Z fi 2025-12-04T09:34:23.9742754Z  2025-12-04T09:34:23.9743091Z # detached container should get cleaned up by teardown_ec2_linux 2025-12-04T09:34:23.9743615Z # TODO: Stop building test binaries as part of the build phase 2025-12-04T09:34:23.9744070Z # Used for GPU_FLAG since that doesn't play nice 2025-12-04T09:34:23.9744480Z # shellcheck disable=SC2086,SC2090 2025-12-04T09:34:23.9744821Z container_name=$(docker run \ 2025-12-04T09:34:23.9745137Z  ${GPU_FLAG:-} \ 2025-12-04T09:34:23.9745420Z  -e BUILD_ENVIRONMENT \ 2025-12-04T09:34:23.9745719Z  -e PR_NUMBER \ 2025-12-04T09:34:23.9746210Z  -e GITHUB_ACTIONS \ 2025-12-04T09:34:23.9746497Z  -e GITHUB_REPOSITORY \ 2025-12-04T09:34:23.9746789Z  -e GITHUB_WORKFLOW \ 2025-12-04T09:34:23.9747070Z  -e GITHUB_JOB \ 2025-12-04T09:34:23.9747337Z  -e GITHUB_RUN_ID \ 2025-12-04T09:34:23.9747607Z  -e GITHUB_RUN_NUMBER \ 2025-12-04T09:34:23.9747900Z  -e GITHUB_RUN_ATTEMPT \ 2025-12-04T09:34:23.9748201Z  -e JOB_ID \ 2025-12-04T09:34:23.9748457Z  -e JOB_NAME \ 2025-12-04T09:34:23.9748719Z  -e BASE_SHA \ 2025-12-04T09:34:23.9748975Z  -e BRANCH \ 2025-12-04T09:34:23.9749225Z  -e SHA1 \ 2025-12-04T09:34:23.9749491Z  -e AWS_DEFAULT_REGION \ 2025-12-04T09:34:23.9749784Z  -e IN_WHEEL_TEST \ 2025-12-04T09:34:23.9750055Z  -e SHARD_NUMBER \ 2025-12-04T09:34:23.9750327Z  -e TEST_CONFIG \ 2025-12-04T09:34:23.9750600Z  -e NUM_TEST_SHARDS \ 2025-12-04T09:34:23.9750893Z  -e REENABLED_ISSUES \ 2025-12-04T09:34:23.9751193Z  -e CONTINUE_THROUGH_ERROR \ 2025-12-04T09:34:23.9751511Z  -e VERBOSE_TEST_LOGS \ 2025-12-04T09:34:23.9751810Z  -e TEST_SHOWLOCALS \ 2025-12-04T09:34:23.9752090Z  -e NO_TEST_TIMEOUT \ 2025-12-04T09:34:23.9752375Z  -e NO_TD \ 2025-12-04T09:34:23.9752661Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2025-12-04T09:34:23.9753017Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2025-12-04T09:34:23.9753380Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2025-12-04T09:34:23.9753741Z  -e TESTS_TO_INCLUDE \ 2025-12-04T09:34:23.9754035Z  -e HUGGING_FACE_HUB_TOKEN \ 2025-12-04T09:34:23.9754352Z  -e DASHBOARD_TAG \ 2025-12-04T09:34:23.9754721Z  --env-file="${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:34:23.9755133Z  --ulimit stack=10485760:83886080 \ 2025-12-04T09:34:23.9755451Z  --ulimit core=0 \ 2025-12-04T09:34:23.9755775Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:34:23.9756168Z  --security-opt seccomp=unconfined \ 2025-12-04T09:34:23.9756506Z  --cap-add=SYS_PTRACE \ 2025-12-04T09:34:23.9756799Z  --shm-size="8g" \ 2025-12-04T09:34:23.9757062Z  --tty \ 2025-12-04T09:34:23.9757313Z  --detach \ 2025-12-04T09:34:23.9757579Z  --name="${container_name}" \ 2025-12-04T09:34:23.9757896Z  --user jenkins \ 2025-12-04T09:34:23.9758235Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2025-12-04T09:34:23.9758627Z  -w /var/lib/jenkins/workspace \ 2025-12-04T09:34:23.9759147Z  "${DOCKER_IMAGE}" 2025-12-04T09:34:23.9759411Z ) 2025-12-04T09:34:23.9759666Z # save container name for later step 2025-12-04T09:34:23.9760074Z echo "CONTAINER_NAME=${container_name}" >> "$GITHUB_ENV" 2025-12-04T09:34:23.9760798Z # jenkins user does not have write permission to mounted workspace; work-around by copying within container to jenkins home 2025-12-04T09:34:23.9761726Z docker exec -t "${container_name}" sh -c "cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && ${TEST_COMMAND}" 2025-12-04T09:34:23.9778139Z shell: /usr/bin/bash -e {0} 2025-12-04T09:34:23.9778416Z env: 2025-12-04T09:34:23.9778638Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:34:23.9779053Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:23.9779646Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:23.9780201Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:23.9781152Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:34:23.9782191Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:34:23.9782481Z AWS_REGION: us-east-1 2025-12-04T09:34:23.9782792Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:34:23.9783161Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:34:23.9788615Z AWS_SESSION_TOKEN: *** 2025-12-04T09:34:23.9788912Z BUILD_ENVIRONMENT: linux-jammy-rocm-py3.10 2025-12-04T09:34:23.9789231Z PR_NUMBER: 2025-12-04T09:34:23.9789475Z GITHUB_REPOSITORY: pytorch/pytorch 2025-12-04T09:34:23.9789783Z GITHUB_WORKFLOW: rocm-navi31 2025-12-04T09:34:23.9790051Z GITHUB_JOB: test 2025-12-04T09:34:23.9790285Z GITHUB_RUN_ID: 19923123999 2025-12-04T09:34:23.9790551Z GITHUB_RUN_NUMBER: 399 2025-12-04T09:34:23.9790805Z GITHUB_RUN_ATTEMPT: 1 2025-12-04T09:34:23.9791053Z JOB_ID: 57118756853 2025-12-04T09:34:23.9791449Z JOB_NAME: linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:34:23.9791892Z BRANCH: main 2025-12-04T09:34:23.9792157Z SHA1: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:23.9792549Z BASE_SHA: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:23.9792887Z TEST_CONFIG: default 2025-12-04T09:34:23.9793125Z SHARD_NUMBER: 2 2025-12-04T09:34:23.9793351Z NUM_TEST_SHARDS: 2 2025-12-04T09:34:23.9793585Z REENABLED_ISSUES: 2025-12-04T09:34:23.9793830Z CONTINUE_THROUGH_ERROR: True 2025-12-04T09:34:23.9794109Z VERBOSE_TEST_LOGS: False 2025-12-04T09:34:23.9794371Z TEST_SHOWLOCALS: False 2025-12-04T09:34:23.9794630Z NO_TEST_TIMEOUT: False 2025-12-04T09:34:23.9794869Z NO_TD: False 2025-12-04T09:34:23.9795545Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:34:23.9796297Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 0 2025-12-04T09:34:23.9796613Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2025-12-04T09:34:23.9798101Z TESTS_TO_INCLUDE: test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune 2025-12-04T09:34:23.9799569Z DASHBOARD_TAG: 2025-12-04T09:34:23.9799908Z HUGGING_FACE_HUB_TOKEN: *** 2025-12-04T09:34:23.9800173Z ##[endgroup] 2025-12-04T09:34:23.9837359Z + [[ default == \m\u\l\t\i\g\p\u ]] 2025-12-04T09:34:23.9837702Z + [[ linux-jammy-rocm-py3.10 == *onnx* ]] 2025-12-04T09:34:23.9838038Z + TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:34:23.9850978Z +++ nproc --ignore=2 2025-12-04T09:34:23.9874482Z ++ docker run --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e MAX_JOBS=62 -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e TESTS_TO_INCLUDE -e HUGGING_FACE_HUB_TOKEN -e DASHBOARD_TAG --env-file=/home/pytorchci/actions-runner/_work/_temp/github_env_19923123999 --ulimit stack=10485760:83886080 --ulimit core=0 --env-file=/tmp/github_env_19923123999 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --shm-size=8g --tty --detach --name= --user jenkins -v /home/pytorchci/actions-runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:34:24.6790491Z + container_name=22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T09:34:24.6791893Z + echo CONTAINER_NAME=22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T09:34:24.6793375Z + docker exec -t 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 sh -c 'cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && .ci/pytorch/test.sh' 2025-12-04T09:34:37.1415379Z Processing ./dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:34:37.7783642Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (3.18.0) 2025-12-04T09:34:37.7787068Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (4.12.2) 2025-12-04T09:34:37.7788707Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (1.13.3) 2025-12-04T09:34:37.7792033Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (2.8.8) 2025-12-04T09:34:37.7793795Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (3.1.6) 2025-12-04T09:34:37.7798495Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (2025.10.0) 2025-12-04T09:34:37.8057656Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch==2.10.0a0+gitffd9b0f) (1.3.0) 2025-12-04T09:34:37.8096321Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch==2.10.0a0+gitffd9b0f) (3.0.3) 2025-12-04T09:34:38.1000188Z Installing collected packages: torch 2025-12-04T09:34:46.8401521Z Successfully installed torch-2.10.0a0+gitffd9b0f 2025-12-04T09:34:46.8842777Z + export TERM=vt100 2025-12-04T09:34:46.8843259Z + TERM=vt100 2025-12-04T09:34:46.8848225Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:34:46.8870061Z + source .ci/pytorch/common.sh 2025-12-04T09:34:46.8876913Z +++ dirname .ci/pytorch/common.sh 2025-12-04T09:34:46.8886063Z ++ source .ci/pytorch/common_utils.sh 2025-12-04T09:34:46.8887289Z +++ declare -f -t trap_add 2025-12-04T09:34:46.8890366Z ++ set -ex -o pipefail 2025-12-04T09:34:46.8890805Z ++ [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:34:46.8891296Z ++ unset HIP_PLATFORM 2025-12-04T09:34:46.8891687Z ++ export PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:34:46.8892576Z ++ PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:34:46.8892994Z ++ BUILD_TEST_LIBTORCH=0 2025-12-04T09:34:46.8894041Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:34:46.8907509Z + source .ci/pytorch/common-build.sh 2025-12-04T09:34:46.8908885Z ++ [[ linux-jammy-rocm-py3.10 != *win-* ]] 2025-12-04T09:34:46.8919063Z ++++ dirname .ci/pytorch/common-build.sh 2025-12-04T09:34:46.8932735Z +++ cd .ci/pytorch 2025-12-04T09:34:46.8933103Z +++ pwd -P 2025-12-04T09:34:46.8935637Z ++ script_dir=/var/lib/jenkins/pytorch/.ci/pytorch 2025-12-04T09:34:46.8935914Z ++ [[ linux-jammy-rocm-py3.10 == *-pch* ]] 2025-12-04T09:34:46.8936142Z ++ which sccache 2025-12-04T09:34:46.8950775Z ++ [[ -z '' ]] 2025-12-04T09:34:46.8951121Z ++ unset SCCACHE_BUCKET 2025-12-04T09:34:46.8951428Z ++ unset SCCACHE_REGION 2025-12-04T09:34:46.8951726Z ++ sccache --stop-server 2025-12-04T09:34:46.8981187Z ++ true 2025-12-04T09:34:46.8981468Z ++ rm -f /var/lib/jenkins/sccache_error.log 2025-12-04T09:34:46.8999414Z ++ trap_add sccache_epilogue EXIT 2025-12-04T09:34:46.8999790Z ++ trap_add_cmd=sccache_epilogue 2025-12-04T09:34:46.9000087Z ++ shift 2025-12-04T09:34:46.9000340Z ++ for trap_add_name in "$@" 2025-12-04T09:34:46.9008979Z ++++ trap -p EXIT 2025-12-04T09:34:46.9011934Z +++ eval 'extract_trap_cmd ' 2025-12-04T09:34:46.9012199Z ++++ extract_trap_cmd 2025-12-04T09:34:46.9012402Z ++++ printf '%s\n' '' 2025-12-04T09:34:46.9012611Z +++ printf '%s\n' sccache_epilogue 2025-12-04T09:34:46.9014079Z ++ trap -- ' 2025-12-04T09:34:46.9014269Z sccache_epilogue' EXIT 2025-12-04T09:34:46.9014468Z ++ [[ -n '' ]] 2025-12-04T09:34:46.9014671Z ++ [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:34:46.9014969Z ++ SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:34:46.9015246Z ++ SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:34:46.9015455Z ++ sccache --start-server 2025-12-04T09:34:46.9036595Z sccache: Starting the server... 2025-12-04T09:34:46.9151270Z sccache: Listening on address 127.0.0.1:4226 2025-12-04T09:34:46.9159619Z ++ sccache --zero-stats 2025-12-04T09:34:46.9180704Z Statistics zeroed. 2025-12-04T09:34:46.9186748Z ++ which ccache 2025-12-04T09:34:46.9198396Z + [[ linux-jammy-rocm-py3.10 != *rocm* ]] 2025-12-04T09:34:46.9198803Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:34:46.9199173Z + echo 'Environment variables:' 2025-12-04T09:34:46.9199503Z Environment variables: 2025-12-04T09:34:46.9199770Z + env 2025-12-04T09:34:46.9212868Z GITHUB_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch/pytorch 2025-12-04T09:34:46.9213159Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:34:46.9213356Z BUILD_ENVIRONMENT=linux-jammy-rocm-py3.10 2025-12-04T09:34:46.9213563Z HOSTNAME=ctr-navi3x-b13-ws9 2025-12-04T09:34:46.9213941Z GITHUB_PATH=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/add_path_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9214326Z GITHUB_ACTION=__run_2 2025-12-04T09:34:46.9214501Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2025-12-04T09:34:46.9214691Z GITHUB_RUN_NUMBER=399 2025-12-04T09:34:46.9214845Z TEST_CONFIG=default 2025-12-04T09:34:46.9215002Z RUNNER_NAME=ctr-navi3x-b13-ws9 2025-12-04T09:34:46.9215187Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:34:46.9215388Z AWS_DEFAULT_REGION=us-east-1 2025-12-04T09:34:46.9215642Z RUNNER_ARTIFACT_DIR=/home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:46.9215926Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-12-04T09:34:46.9216120Z GITHUB_REF_TYPE=branch 2025-12-04T09:34:46.9216310Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9216642Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:34:46.9217098Z *** 2025-12-04T09:34:46.9217245Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:34:46.9217428Z GITHUB_ACTIONS=true 2025-12-04T09:34:46.9217602Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9217838Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9218171Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/rocm-navi31.yml@refs/heads/main 2025-12-04T09:34:46.9218468Z UCC_HOME=/usr 2025-12-04T09:34:46.9218618Z RUNNER_ENVIRONMENT=self-hosted 2025-12-04T09:34:46.9218804Z VERBOSE_TEST_LOGS=False 2025-12-04T09:34:46.9218963Z GITHUB_REF=refs/heads/main 2025-12-04T09:34:46.9219301Z RUNNER_OS=Linux 2025-12-04T09:34:46.9219444Z SHARD_NUMBER=2 2025-12-04T09:34:46.9219589Z GITHUB_REF_PROTECTED=true 2025-12-04T09:34:46.9219752Z HOME=/var/lib/jenkins 2025-12-04T09:34:46.9219921Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:34:46.9220129Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:34:46.9220374Z RUNNER_DOCS_DIR=/home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:46.9220614Z LANG=C.UTF-8 2025-12-04T09:34:46.9220782Z UCX_COMMIT=29831d319e6be55cb8c768ca61de335c934ca39e 2025-12-04T09:34:46.9220996Z PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:34:46.9221214Z RUNNER_TRACKING_ID=github_b39c5558-40aa-44d8-950e-91afa7e0082f 2025-12-04T09:34:46.9221443Z RUNNER_ARCH=X64 2025-12-04T09:34:46.9221627Z RUNNER_TEMP=/home/pytorchci/actions-runner/_work/_temp 2025-12-04T09:34:46.9221840Z NUM_TEST_SHARDS=2 2025-12-04T09:34:46.9221979Z UCX_HOME=/usr 2025-12-04T09:34:46.9222332Z GITHUB_STATE=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/save_state_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9222814Z JOB_NAME=linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:34:46.9223094Z MAGMA_HOME=/opt/rocm/magma 2025-12-04T09:34:46.9223448Z GITHUB_ENV=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_env_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9224054Z GITHUB_EVENT_PATH=/home/pytorchci/actions-runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:34:46.9224357Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:34:46.9224519Z DASHBOARD_TAG= 2025-12-04T09:34:46.9224669Z GITHUB_RUN_ID=19923123999 2025-12-04T09:34:46.9225059Z GITHUB_STEP_SUMMARY=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/step_summary_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9225474Z GITHUB_ACTOR=pytorchmergebot 2025-12-04T09:34:46.9225641Z PR_NUMBER= 2025-12-04T09:34:46.9225776Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:34:46.9225936Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:34:46.9226146Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:34:46.9226359Z TERM=vt100 2025-12-04T09:34:46.9226507Z INSTALLED_VISION=yes 2025-12-04T09:34:46.9226662Z BRANCH=main 2025-12-04T09:34:46.9226817Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:34:46.9227740Z TESTS_TO_INCLUDE=test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune 2025-12-04T09:34:46.9228897Z GITHUB_ACTION_PATH=/home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2025-12-04T09:34:46.9229262Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:34:46.9229482Z PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx950;gfx1100 2025-12-04T09:34:46.9229721Z UCC_COMMIT=9f4b242cbbd8b1462cbc732eb29316cdfa124b77 2025-12-04T09:34:46.9229932Z REENABLED_ISSUES= 2025-12-04T09:34:46.9230068Z SHLVL=1 2025-12-04T09:34:46.9230193Z MAX_JOBS=62 2025-12-04T09:34:46.9230424Z RUNNER_TEST_RESULTS_DIR=/home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:46.9230710Z GITHUB_ACTOR_ID=97764156 2025-12-04T09:34:46.9230928Z RUNNER_TOOL_CACHE=/home/pytorchci/actions-runner/_work/_tool 2025-12-04T09:34:46.9231219Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9231451Z GITHUB_REF_NAME=main 2025-12-04T09:34:46.9231603Z ROCM_PATH=/opt/rocm 2025-12-04T09:34:46.9231745Z GITHUB_JOB=test 2025-12-04T09:34:46.9231886Z NO_TEST_TIMEOUT=False 2025-12-04T09:34:46.9232050Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:34:46.9232225Z LC_ALL=C.UTF-8 2025-12-04T09:34:46.9232368Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:34:46.9232583Z RUNNER_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch 2025-12-04T09:34:46.9232823Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:34:46.9232992Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:34:46.9233679Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:34:46.9234287Z GITHUB_BASE_REF= 2025-12-04T09:34:46.9234430Z CI=true 2025-12-04T09:34:46.9234572Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:34:46.9234747Z JOB_ID=57118756853 2025-12-04T09:34:46.9234891Z GITHUB_HEAD_REF= 2025-12-04T09:34:46.9235033Z GITHUB_ACTION_REF= 2025-12-04T09:34:46.9235179Z TEST_SHOWLOCALS=False 2025-12-04T09:34:46.9235344Z GITHUB_WORKFLOW=rocm-navi31 2025-12-04T09:34:46.9235521Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:34:46.9235910Z GITHUB_OUTPUT=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_output_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9236301Z NO_TD=False 2025-12-04T09:34:46.9236439Z OLDPWD=/var/lib/jenkins 2025-12-04T09:34:46.9236590Z _=/usr/bin/env 2025-12-04T09:34:46.9236795Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2025-12-04T09:34:46.9315150Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch 2025-12-04T09:34:46.9315835Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2025-12-04T09:34:46.9316690Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib 2025-12-04T09:34:46.9317328Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/test 2025-12-04T09:34:46.9317798Z + BUILD_DIR=build 2025-12-04T09:34:46.9318071Z + BUILD_RENAMED_DIR=build_renamed 2025-12-04T09:34:46.9318388Z + BUILD_BIN_DIR=build/bin 2025-12-04T09:34:46.9318662Z + SHARD_NUMBER=2 2025-12-04T09:34:46.9318907Z + NUM_TEST_SHARDS=2 2025-12-04T09:34:46.9319194Z + export TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:34:46.9319428Z + TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:34:46.9319648Z + export VALGRIND=ON 2025-12-04T09:34:46.9319828Z + VALGRIND=ON 2025-12-04T09:34:46.9320020Z + [[ linux-jammy-rocm-py3.10 == *clang9* ]] 2025-12-04T09:34:46.9320285Z + [[ linux-jammy-rocm-py3.10 == *xpu* ]] 2025-12-04T09:34:46.9320516Z + detect_cuda_arch 2025-12-04T09:34:46.9320712Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:34:46.9320960Z + [[ linux-jammy-rocm-py3.10 == *s390x* ]] 2025-12-04T09:34:46.9321189Z + [[ 0 == \1 ]] 2025-12-04T09:34:46.9321355Z + [[ True == \1 ]] 2025-12-04T09:34:46.9321545Z + [[ linux-jammy-rocm-py3.10 != *bazel* ]] 2025-12-04T09:34:46.9321795Z ++ realpath build/custom_test_artifacts 2025-12-04T09:34:46.9333617Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/pytorch/build/custom_test_artifacts 2025-12-04T09:34:46.9334640Z + [[ -n test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune ]] 2025-12-04T09:34:46.9335556Z + echo 'Setting INCLUDE_CLAUSE' 2025-12-04T09:34:46.9335732Z Setting INCLUDE_CLAUSE 2025-12-04T09:34:46.9336665Z + INCLUDE_CLAUSE='--include test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune' 2025-12-04T09:34:46.9337634Z + echo 'Environment variables' 2025-12-04T09:34:46.9337816Z Environment variables 2025-12-04T09:34:46.9337962Z + env 2025-12-04T09:34:46.9342828Z GITHUB_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch/pytorch 2025-12-04T09:34:46.9343111Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:34:46.9343309Z BUILD_ENVIRONMENT=linux-jammy-rocm-py3.10 2025-12-04T09:34:46.9343521Z HOSTNAME=ctr-navi3x-b13-ws9 2025-12-04T09:34:46.9344021Z GITHUB_PATH=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/add_path_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9344407Z GITHUB_ACTION=__run_2 2025-12-04T09:34:46.9344574Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2025-12-04T09:34:46.9344762Z GITHUB_RUN_NUMBER=399 2025-12-04T09:34:46.9344912Z TEST_CONFIG=default 2025-12-04T09:34:46.9345073Z RUNNER_NAME=ctr-navi3x-b13-ws9 2025-12-04T09:34:46.9345259Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:34:46.9345451Z AWS_DEFAULT_REGION=us-east-1 2025-12-04T09:34:46.9345701Z RUNNER_ARTIFACT_DIR=/home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T09:34:46.9345985Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-12-04T09:34:46.9346181Z GITHUB_REF_TYPE=branch 2025-12-04T09:34:46.9346368Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9346655Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:34:46.9346844Z *** 2025-12-04T09:34:46.9346985Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:34:46.9347159Z GITHUB_ACTIONS=true 2025-12-04T09:34:46.9347336Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9347576Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9347929Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/rocm-navi31.yml@refs/heads/main 2025-12-04T09:34:46.9348227Z UCC_HOME=/usr 2025-12-04T09:34:46.9348373Z TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:34:46.9348700Z RUNNER_ENVIRONMENT=self-hosted 2025-12-04T09:34:46.9348874Z VERBOSE_TEST_LOGS=False 2025-12-04T09:34:46.9349045Z GITHUB_REF=refs/heads/main 2025-12-04T09:34:46.9349301Z RUNNER_OS=Linux 2025-12-04T09:34:46.9349536Z SHARD_NUMBER=2 2025-12-04T09:34:46.9349846Z GITHUB_REF_PROTECTED=true 2025-12-04T09:34:46.9350117Z HOME=/var/lib/jenkins 2025-12-04T09:34:46.9350376Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:34:46.9350772Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:34:46.9360664Z RUNNER_DOCS_DIR=/home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T09:34:46.9360937Z LANG=C.UTF-8 2025-12-04T09:34:46.9361127Z UCX_COMMIT=29831d319e6be55cb8c768ca61de335c934ca39e 2025-12-04T09:34:46.9361362Z PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:34:46.9361592Z RUNNER_TRACKING_ID=github_b39c5558-40aa-44d8-950e-91afa7e0082f 2025-12-04T09:34:46.9361834Z RUNNER_ARCH=X64 2025-12-04T09:34:46.9362034Z RUNNER_TEMP=/home/pytorchci/actions-runner/_work/_temp 2025-12-04T09:34:46.9362263Z NUM_TEST_SHARDS=2 2025-12-04T09:34:46.9362416Z UCX_HOME=/usr 2025-12-04T09:34:46.9362778Z GITHUB_STATE=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/save_state_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9363269Z JOB_NAME=linux-jammy-rocm-py3_10 / test (default, 2, 2, linux.rocm.gpu.gfx1100) 2025-12-04T09:34:46.9363563Z MAGMA_HOME=/opt/rocm/magma 2025-12-04T09:34:46.9363928Z GITHUB_ENV=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_env_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9364424Z GITHUB_EVENT_PATH=/home/pytorchci/actions-runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:34:46.9364738Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:34:46.9364903Z DASHBOARD_TAG= 2025-12-04T09:34:46.9365055Z GITHUB_RUN_ID=19923123999 2025-12-04T09:34:46.9365448Z GITHUB_STEP_SUMMARY=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/step_summary_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9365869Z GITHUB_ACTOR=pytorchmergebot 2025-12-04T09:34:46.9366044Z PR_NUMBER= 2025-12-04T09:34:46.9366265Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:34:46.9366421Z VALGRIND=ON 2025-12-04T09:34:46.9366565Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:34:46.9366781Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:34:46.9366999Z TERM=vt100 2025-12-04T09:34:46.9367138Z INSTALLED_VISION=yes 2025-12-04T09:34:46.9367293Z BRANCH=main 2025-12-04T09:34:46.9367439Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:34:46.9368507Z TESTS_TO_INCLUDE=test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune 2025-12-04T09:34:46.9369606Z GITHUB_ACTION_PATH=/home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2025-12-04T09:34:46.9369972Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:34:46.9370193Z PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx950;gfx1100 2025-12-04T09:34:46.9370432Z UCC_COMMIT=9f4b242cbbd8b1462cbc732eb29316cdfa124b77 2025-12-04T09:34:46.9370644Z REENABLED_ISSUES= 2025-12-04T09:34:46.9370789Z SHLVL=1 2025-12-04T09:34:46.9370916Z MAX_JOBS=62 2025-12-04T09:34:46.9371151Z RUNNER_TEST_RESULTS_DIR=/home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T09:34:46.9371442Z GITHUB_ACTOR_ID=97764156 2025-12-04T09:34:46.9371659Z RUNNER_TOOL_CACHE=/home/pytorchci/actions-runner/_work/_tool 2025-12-04T09:34:46.9371947Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:34:46.9372180Z GITHUB_REF_NAME=main 2025-12-04T09:34:46.9372335Z ROCM_PATH=/opt/rocm 2025-12-04T09:34:46.9372486Z GITHUB_JOB=test 2025-12-04T09:34:46.9372639Z NO_TEST_TIMEOUT=False 2025-12-04T09:34:46.9372815Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:34:46.9372999Z LC_ALL=C.UTF-8 2025-12-04T09:34:46.9373152Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:34:46.9373485Z RUNNER_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch 2025-12-04T09:34:46.9373730Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:34:46.9373900Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:34:46.9374482Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:34:46.9375075Z GITHUB_BASE_REF= 2025-12-04T09:34:46.9375217Z CI=true 2025-12-04T09:34:46.9375356Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:34:46.9375528Z JOB_ID=57118756853 2025-12-04T09:34:46.9375676Z GITHUB_HEAD_REF= 2025-12-04T09:34:46.9375822Z GITHUB_ACTION_REF= 2025-12-04T09:34:46.9375974Z TEST_SHOWLOCALS=False 2025-12-04T09:34:46.9376141Z GITHUB_WORKFLOW=rocm-navi31 2025-12-04T09:34:46.9376321Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:34:46.9376710Z GITHUB_OUTPUT=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_output_f11a9af5-94e3-406c-8260-8c6dab297830 2025-12-04T09:34:46.9377107Z NO_TD=False 2025-12-04T09:34:46.9377245Z OLDPWD=/var/lib/jenkins 2025-12-04T09:34:46.9377399Z _=/usr/bin/env 2025-12-04T09:34:46.9377558Z + echo 'Testing pytorch' 2025-12-04T09:34:46.9377716Z Testing pytorch 2025-12-04T09:34:46.9377862Z + export LANG=C.UTF-8 2025-12-04T09:34:46.9378011Z + LANG=C.UTF-8 2025-12-04T09:34:46.9378145Z + PR_NUMBER= 2025-12-04T09:34:46.9378287Z + [[ default == \d\e\f\a\u\l\t ]] 2025-12-04T09:34:46.9378472Z + export CUDA_VISIBLE_DEVICES=0 2025-12-04T09:34:46.9378647Z + CUDA_VISIBLE_DEVICES=0 2025-12-04T09:34:46.9378811Z + export HIP_VISIBLE_DEVICES=0 2025-12-04T09:34:46.9378988Z + HIP_VISIBLE_DEVICES=0 2025-12-04T09:34:46.9379157Z + [[ default == \d\i\s\t\r\i\b\u\t\e\d ]] 2025-12-04T09:34:46.9379350Z + [[ default == \s\l\o\w ]] 2025-12-04T09:34:46.9379543Z + [[ linux-jammy-rocm-py3.10 == *slow-gradcheck* ]] 2025-12-04T09:34:46.9379767Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:34:46.9379965Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:34:46.9380180Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:34:46.9380395Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:34:46.9380588Z + [[ default == *crossref* ]] 2025-12-04T09:34:46.9380766Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:34:46.9380955Z + export VALGRIND=OFF 2025-12-04T09:34:46.9381104Z + VALGRIND=OFF 2025-12-04T09:34:46.9381237Z + rocminfo 2025-12-04T09:34:46.9471328Z ROCk module version 6.14.14 is loaded 2025-12-04T09:34:47.6965358Z ===================== 2025-12-04T09:34:47.6965976Z HSA System Attributes 2025-12-04T09:34:47.6966528Z ===================== 2025-12-04T09:34:47.6966909Z Runtime Version: 1.18 2025-12-04T09:34:47.6967787Z Runtime Ext Version: 1.14 2025-12-04T09:34:47.6968226Z System Timestamp Freq.: 1000.000000MHz 2025-12-04T09:34:47.6968940Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2025-12-04T09:34:47.6969718Z Machine Model: LARGE 2025-12-04T09:34:47.6970361Z System Endianness: LITTLE 2025-12-04T09:34:47.6970891Z Mwaitx: DISABLED 2025-12-04T09:34:47.6971318Z XNACK enabled: NO 2025-12-04T09:34:47.6971716Z DMAbuf Support: YES 2025-12-04T09:34:47.6972102Z VMM Support: YES 2025-12-04T09:34:47.6972344Z 2025-12-04T09:34:47.6972480Z ========== 2025-12-04T09:34:47.6972850Z HSA Agents 2025-12-04T09:34:47.6973206Z ========== 2025-12-04T09:34:47.6973556Z ******* 2025-12-04T09:34:47.6973813Z Agent 1 2025-12-04T09:34:47.6973995Z ******* 2025-12-04T09:34:47.6974246Z Name: AMD Ryzen Threadripper PRO 5975WX 32-Cores 2025-12-04T09:34:47.6974561Z Uuid: CPU-XX 2025-12-04T09:34:47.6974885Z Marketing Name: AMD Ryzen Threadripper PRO 5975WX 32-Cores 2025-12-04T09:34:47.6975368Z Vendor Name: CPU 2025-12-04T09:34:47.6975674Z Feature: None specified 2025-12-04T09:34:47.6975971Z Profile: FULL_PROFILE 2025-12-04T09:34:47.6976269Z Float Round Mode: NEAR 2025-12-04T09:34:47.6976573Z Max Queue Number: 0(0x0) 2025-12-04T09:34:47.6976866Z Queue Min Size: 0(0x0) 2025-12-04T09:34:47.6977158Z Queue Max Size: 0(0x0) 2025-12-04T09:34:47.6977443Z Queue Type: MULTI 2025-12-04T09:34:47.6977721Z Node: 0 2025-12-04T09:34:47.6978000Z Device Type: CPU 2025-12-04T09:34:47.6978262Z Cache Info: 2025-12-04T09:34:47.6978480Z L1: 32768(0x8000) KB 2025-12-04T09:34:47.6978752Z Chip ID: 0(0x0) 2025-12-04T09:34:47.6979039Z ASIC Revision: 0(0x0) 2025-12-04T09:34:47.6979337Z Cacheline Size: 64(0x40) 2025-12-04T09:34:47.6979638Z Max Clock Freq. (MHz): 3600 2025-12-04T09:34:47.6979915Z BDFID: 0 2025-12-04T09:34:47.6980197Z Internal Node ID: 0 2025-12-04T09:34:47.6980492Z Compute Unit: 64 2025-12-04T09:34:47.6980780Z SIMDs per CU: 0 2025-12-04T09:34:47.6981074Z Shader Engines: 0 2025-12-04T09:34:47.6981376Z Shader Arrs. per Eng.: 0 2025-12-04T09:34:47.6981727Z WatchPts on Addr. Ranges:1 2025-12-04T09:34:47.6982007Z Memory Properties: 2025-12-04T09:34:47.6982214Z Features: None 2025-12-04T09:34:47.6982424Z Pool Info: 2025-12-04T09:34:47.6982624Z Pool 1 2025-12-04T09:34:47.6982878Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:34:47.6983183Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:34:47.6983685Z Allocatable: TRUE 2025-12-04T09:34:47.6984229Z Alloc Granule: 4KB 2025-12-04T09:34:47.6984807Z Alloc Recommended Granule:4KB 2025-12-04T09:34:47.6985589Z Alloc Alignment: 4KB 2025-12-04T09:34:47.6986160Z Accessible by all: TRUE 2025-12-04T09:34:47.6986634Z Pool 2 2025-12-04T09:34:47.6987071Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:34:47.6987591Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:34:47.6988107Z Allocatable: TRUE 2025-12-04T09:34:47.6988645Z Alloc Granule: 4KB 2025-12-04T09:34:47.6989202Z Alloc Recommended Granule:4KB 2025-12-04T09:34:47.6989765Z Alloc Alignment: 4KB 2025-12-04T09:34:47.6990317Z Accessible by all: TRUE 2025-12-04T09:34:47.6990782Z Pool 3 2025-12-04T09:34:47.6991217Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T09:34:47.6991721Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:34:47.6992227Z Allocatable: TRUE 2025-12-04T09:34:47.6992752Z Alloc Granule: 4KB 2025-12-04T09:34:47.6993507Z Alloc Recommended Granule:4KB 2025-12-04T09:34:47.6994071Z Alloc Alignment: 4KB 2025-12-04T09:34:47.6994616Z Accessible by all: TRUE 2025-12-04T09:34:47.6995085Z Pool 4 2025-12-04T09:34:47.6995510Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:34:47.6996016Z Size: 263739968(0xfb85a40) KB 2025-12-04T09:34:47.6996522Z Allocatable: TRUE 2025-12-04T09:34:47.6997065Z Alloc Granule: 4KB 2025-12-04T09:34:47.6997622Z Alloc Recommended Granule:4KB 2025-12-04T09:34:47.6998182Z Alloc Alignment: 4KB 2025-12-04T09:34:47.6998725Z Accessible by all: TRUE 2025-12-04T09:34:47.6999204Z ISA Info: 2025-12-04T09:34:47.6999545Z ******* 2025-12-04T09:34:47.6999878Z Agent 2 2025-12-04T09:34:47.7000199Z ******* 2025-12-04T09:34:47.7000557Z Name: gfx1100 2025-12-04T09:34:47.7001052Z Uuid: GPU-0ac6979acc6ebee8 2025-12-04T09:34:47.7001569Z Marketing Name: 2025-12-04T09:34:47.7002094Z Vendor Name: AMD 2025-12-04T09:34:47.7002614Z Feature: KERNEL_DISPATCH 2025-12-04T09:34:47.7003139Z Profile: BASE_PROFILE 2025-12-04T09:34:47.7003670Z Float Round Mode: NEAR 2025-12-04T09:34:47.7004208Z Max Queue Number: 128(0x80) 2025-12-04T09:34:47.7004741Z Queue Min Size: 64(0x40) 2025-12-04T09:34:47.7005263Z Queue Max Size: 131072(0x20000) 2025-12-04T09:34:47.7005779Z Queue Type: MULTI 2025-12-04T09:34:47.7006360Z Node: 1 2025-12-04T09:34:47.7006852Z Device Type: GPU 2025-12-04T09:34:47.7007308Z Cache Info: 2025-12-04T09:34:47.7007686Z L1: 32(0x20) KB 2025-12-04T09:34:47.7008131Z L2: 6144(0x1800) KB 2025-12-04T09:34:47.7008808Z L3: 81920(0x14000) KB 2025-12-04T09:34:47.7009277Z Chip ID: 29772(0x744c) 2025-12-04T09:34:47.7009781Z ASIC Revision: 0(0x0) 2025-12-04T09:34:47.7010310Z Cacheline Size: 128(0x80) 2025-12-04T09:34:47.7010850Z Max Clock Freq. (MHz): 2025 2025-12-04T09:34:47.7011347Z BDFID: 17152 2025-12-04T09:34:47.7011847Z Internal Node ID: 1 2025-12-04T09:34:47.7012368Z Compute Unit: 84 2025-12-04T09:34:47.7012874Z SIMDs per CU: 2 2025-12-04T09:34:47.7013391Z Shader Engines: 6 2025-12-04T09:34:47.7013922Z Shader Arrs. per Eng.: 2 2025-12-04T09:34:47.7014478Z WatchPts on Addr. Ranges:4 2025-12-04T09:34:47.7015042Z Coherent Host Access: FALSE 2025-12-04T09:34:47.7015528Z Memory Properties: 2025-12-04T09:34:47.7015919Z Features: KERNEL_DISPATCH 2025-12-04T09:34:47.7016665Z Fast F16 Operation: TRUE 2025-12-04T09:34:47.7017199Z Wavefront Size: 32(0x20) 2025-12-04T09:34:47.7017737Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:34:47.7018234Z Workgroup Max Size per Dimension: 2025-12-04T09:34:47.7018649Z x 1024(0x400) 2025-12-04T09:34:47.7019087Z y 1024(0x400) 2025-12-04T09:34:47.7019513Z z 1024(0x400) 2025-12-04T09:34:47.7019990Z Max Waves Per CU: 32(0x20) 2025-12-04T09:34:47.7020530Z Max Work-item Per CU: 1024(0x400) 2025-12-04T09:34:47.7021065Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:34:47.7021540Z Grid Max Size per Dimension: 2025-12-04T09:34:47.7021926Z x 2147483647(0x7fffffff) 2025-12-04T09:34:47.7022370Z y 65535(0xffff) 2025-12-04T09:34:47.7022794Z z 65535(0xffff) 2025-12-04T09:34:47.7023299Z Max fbarriers/Workgrp: 32 2025-12-04T09:34:47.7023899Z Packet Processor uCode:: 552 2025-12-04T09:34:47.7024459Z SDMA engine uCode:: 24 2025-12-04T09:34:47.7024995Z IOMMU Support:: None 2025-12-04T09:34:47.7025459Z Pool Info: 2025-12-04T09:34:47.7025807Z Pool 1 2025-12-04T09:34:47.7026246Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:34:47.7026776Z Size: 20955136(0x13fc000) KB 2025-12-04T09:34:47.7027291Z Allocatable: TRUE 2025-12-04T09:34:47.7027825Z Alloc Granule: 4KB 2025-12-04T09:34:47.7028397Z Alloc Recommended Granule:2048KB 2025-12-04T09:34:47.7028970Z Alloc Alignment: 4KB 2025-12-04T09:34:47.7029524Z Accessible by all: FALSE 2025-12-04T09:34:47.7029999Z Pool 2 2025-12-04T09:34:47.7030427Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:34:47.7030936Z Size: 20955136(0x13fc000) KB 2025-12-04T09:34:47.7031443Z Allocatable: TRUE 2025-12-04T09:34:47.7031973Z Alloc Granule: 4KB 2025-12-04T09:34:47.7035203Z Alloc Recommended Granule:2048KB 2025-12-04T09:34:47.7035797Z Alloc Alignment: 4KB 2025-12-04T09:34:47.7036346Z Accessible by all: FALSE 2025-12-04T09:34:47.7036831Z Pool 3 2025-12-04T09:34:47.7037223Z Segment: GROUP 2025-12-04T09:34:47.7037699Z Size: 64(0x40) KB 2025-12-04T09:34:47.7038192Z Allocatable: FALSE 2025-12-04T09:34:47.7038723Z Alloc Granule: 0KB 2025-12-04T09:34:47.7039280Z Alloc Recommended Granule:0KB 2025-12-04T09:34:47.7039840Z Alloc Alignment: 0KB 2025-12-04T09:34:47.7040389Z Accessible by all: FALSE 2025-12-04T09:34:47.7040861Z ISA Info: 2025-12-04T09:34:47.7041200Z ISA 1 2025-12-04T09:34:47.7041610Z Name: amdgcn-amd-amdhsa--gfx1100 2025-12-04T09:34:47.7042156Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:34:47.7042917Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:34:47.7043473Z Default Rounding Mode: NEAR 2025-12-04T09:34:47.7044037Z Default Rounding Mode: NEAR 2025-12-04T09:34:47.7044560Z Fast f16: TRUE 2025-12-04T09:34:47.7045084Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:34:47.7045588Z Workgroup Max Size per Dimension: 2025-12-04T09:34:47.7046032Z x 1024(0x400) 2025-12-04T09:34:47.7046573Z y 1024(0x400) 2025-12-04T09:34:47.7047000Z z 1024(0x400) 2025-12-04T09:34:47.7047501Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:34:47.7047980Z Grid Max Size per Dimension: 2025-12-04T09:34:47.7048387Z x 2147483647(0x7fffffff) 2025-12-04T09:34:47.7048823Z y 65535(0xffff) 2025-12-04T09:34:47.7049267Z z 65535(0xffff) 2025-12-04T09:34:47.7049790Z FBarrier Max Size: 32 2025-12-04T09:34:47.7050261Z ISA 2 2025-12-04T09:34:47.7050700Z Name: amdgcn-amd-amdhsa--gfx11-generic 2025-12-04T09:34:47.7051253Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:34:47.7051806Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:34:47.7052360Z Default Rounding Mode: NEAR 2025-12-04T09:34:47.7052933Z Default Rounding Mode: NEAR 2025-12-04T09:34:47.7053460Z Fast f16: TRUE 2025-12-04T09:34:47.7053993Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:34:47.7054495Z Workgroup Max Size per Dimension: 2025-12-04T09:34:47.7054923Z x 1024(0x400) 2025-12-04T09:34:47.7055356Z y 1024(0x400) 2025-12-04T09:34:47.7055783Z z 1024(0x400) 2025-12-04T09:34:47.7056263Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:34:47.7056744Z Grid Max Size per Dimension: 2025-12-04T09:34:47.7057145Z x 2147483647(0x7fffffff) 2025-12-04T09:34:47.7057852Z y 65535(0xffff) 2025-12-04T09:34:47.7058294Z z 65535(0xffff) 2025-12-04T09:34:47.7058789Z FBarrier Max Size: 32 2025-12-04T09:34:47.7059269Z *** Done *** 2025-12-04T09:34:47.7183036Z + rocminfo 2025-12-04T09:34:47.7183455Z + grep -E 'Name:.*\sgfx|Marketing' 2025-12-04T09:34:47.7945313Z Marketing Name: AMD Ryzen Threadripper PRO 5975WX 32-Cores 2025-12-04T09:34:47.7946056Z Name: gfx1100 2025-12-04T09:34:47.7946627Z Marketing Name: 2025-12-04T09:34:47.8145376Z + MAYBE_ROCM=rocm/ 2025-12-04T09:34:47.8145850Z + [[ linux-jammy-rocm-py3.10 == *xpu* ]] 2025-12-04T09:34:47.8146420Z + [[ linux-jammy-rocm-py3.10 != *-bazel-* ]] 2025-12-04T09:34:47.8146915Z + pip_install ninja==1.10.2 2025-12-04T09:34:47.8147494Z + pip_install_pkg='python3 -m pip install --progress-bar off' 2025-12-04T09:34:47.8148170Z + python3 -m pip install --progress-bar off ninja==1.10.2 2025-12-04T09:34:48.1582454Z Collecting ninja==1.10.2 2025-12-04T09:34:48.2590211Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2025-12-04T09:34:48.2910035Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2025-12-04T09:34:48.5891139Z Installing collected packages: ninja 2025-12-04T09:34:48.5891505Z Attempting uninstall: ninja 2025-12-04T09:34:48.5896900Z Found existing installation: ninja 1.11.1.4 2025-12-04T09:34:48.5913730Z Uninstalling ninja-1.11.1.4: 2025-12-04T09:34:48.5979628Z Successfully uninstalled ninja-1.11.1.4 2025-12-04T09:34:48.6210174Z Successfully installed ninja-1.10.2 2025-12-04T09:34:48.6618467Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:34:48.6621461Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:34:48.6623193Z + [[ linux-jammy-rocm-py3.10 == *aarch64* ]] 2025-12-04T09:34:48.6623722Z + [[ linux-jammy-rocm-py3.10 == *asan* ]] 2025-12-04T09:34:48.6624227Z + [[ linux-jammy-rocm-py3.10 == *-debug* ]] 2025-12-04T09:34:48.6624716Z + [[ linux-jammy-rocm-py3.10 != *-bazel-* ]] 2025-12-04T09:34:48.6625429Z + echo 'We are not in debug mode: linux-jammy-rocm-py3.10. Expect the assertion to pass' 2025-12-04T09:34:48.6626333Z We are not in debug mode: linux-jammy-rocm-py3.10. Expect the assertion to pass 2025-12-04T09:34:48.6626970Z + cd test 2025-12-04T09:34:48.6627474Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2025-12-04T09:34:49.8141225Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2025-12-04T09:34:49.8142101Z + [[ default == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2025-12-04T09:34:49.8142845Z + [[ default == \l\e\g\a\c\y\_\n\v\i\d\i\a\_\d\r\i\v\e\r ]] 2025-12-04T09:34:49.8146718Z + DYNAMO_BENCHMARK_FLAGS=() 2025-12-04T09:34:49.8147134Z + [[ default == *pr_time_benchmarks* ]] 2025-12-04T09:34:49.8147501Z + [[ default == *dynamo_eager* ]] 2025-12-04T09:34:49.8147739Z + [[ default == *aot_eager* ]] 2025-12-04T09:34:49.8147958Z + [[ default == *aot_inductor* ]] 2025-12-04T09:34:49.8148186Z + [[ default == *max_autotune_inductor* ]] 2025-12-04T09:34:49.8148427Z + [[ default == *inductor* ]] 2025-12-04T09:34:49.8148640Z + [[ default == *dynamic* ]] 2025-12-04T09:34:49.8148847Z + [[ default == *cpu* ]] 2025-12-04T09:34:49.8149047Z + [[ default == *xpu* ]] 2025-12-04T09:34:49.8149274Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2025-12-04T09:34:49.8168326Z + [[ linux-jammy-rocm-py3.10 == *libtorch* ]] 2025-12-04T09:34:49.8169099Z + [[ linux-jammy-rocm-py3.10 == *-bazel-* ]] 2025-12-04T09:34:49.8172677Z + cd test 2025-12-04T09:34:49.8173532Z + python -c 'import torch; print(torch.__config__.show())' 2025-12-04T09:34:50.8020432Z PyTorch built with: 2025-12-04T09:34:50.8021106Z - GCC 11.4 2025-12-04T09:34:50.8021640Z - C++ Version: 201703 2025-12-04T09:34:50.8022492Z - Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:34:50.8023548Z - Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:34:50.8024203Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:34:50.8024725Z - LAPACK is enabled (usually provided by MKL) 2025-12-04T09:34:50.8025208Z - NNPACK is enabled 2025-12-04T09:34:50.8025592Z - CPU capability usage: AVX2 2025-12-04T09:34:50.8026005Z - HIP Runtime 7.1.25424 2025-12-04T09:34:50.8026378Z - MIOpen 3.5.1 2025-12-04T09:34:50.8026712Z - Magma 2.9.0 2025-12-04T09:34:50.8033104Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, COMMIT_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_FBGEMM_GENAI -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -DC10_NODEPRECATED -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=range-loop-construct -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -faligned-new -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, TORCH_VERSION=2.10.0, USE_CUDA=OFF, USE_CUDNN=OFF, USE_CUSPARSELT=OFF, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=ON, USE_ROCM_KERNEL_ASSERT=OFF, USE_XCCL=OFF, USE_XPU=OFF, 2025-12-04T09:34:50.8040189Z 2025-12-04T09:34:51.0480868Z + cd test 2025-12-04T09:34:51.0481319Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2025-12-04T09:34:51.9633119Z ATen/Parallel: 2025-12-04T09:34:51.9633624Z at::get_num_threads() : 32 2025-12-04T09:34:51.9634091Z at::get_num_interop_threads() : 32 2025-12-04T09:34:51.9634557Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:34:51.9634995Z omp_get_max_threads() : 32 2025-12-04T09:34:51.9635844Z Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:34:51.9636694Z mkl_get_max_threads() : 32 2025-12-04T09:34:51.9637267Z Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:34:51.9637913Z std::thread::hardware_concurrency() : 64 2025-12-04T09:34:51.9638382Z Environment variables: 2025-12-04T09:34:51.9638763Z OMP_NUM_THREADS : [not set] 2025-12-04T09:34:51.9639187Z MKL_NUM_THREADS : [not set] 2025-12-04T09:34:51.9639606Z ATen parallel backend: OpenMP 2025-12-04T09:34:51.9639884Z 2025-12-04T09:34:52.1822071Z + [[ default == *numpy_2* ]] 2025-12-04T09:34:52.1822380Z + [[ linux-jammy-rocm-py3.10 == *aarch64* ]] 2025-12-04T09:34:52.1822666Z + [[ default == *backward* ]] 2025-12-04T09:34:52.1822917Z + [[ default == *libtorch_agnostic_targetting* ]] 2025-12-04T09:34:52.1823183Z + [[ default == *xla* ]] 2025-12-04T09:34:52.1823387Z + [[ default == *vllm* ]] 2025-12-04T09:34:52.1823590Z + [[ default == *executorch* ]] 2025-12-04T09:34:52.1823822Z + [[ default == \j\i\t\_\l\e\g\a\c\y ]] 2025-12-04T09:34:52.1824063Z + [[ default == \q\u\a\n\t\i\z\a\t\i\o\n ]] 2025-12-04T09:34:52.1824322Z + [[ linux-jammy-rocm-py3.10 == *libtorch* ]] 2025-12-04T09:34:52.1824583Z + [[ default == distributed ]] 2025-12-04T09:34:52.1824805Z + [[ default == *operator_benchmark* ]] 2025-12-04T09:34:52.1825057Z + [[ default == *operator_microbenchmark* ]] 2025-12-04T09:34:52.1825597Z + [[ default == *attention_microbenchmark* ]] 2025-12-04T09:34:52.1825863Z + [[ default == *inductor_distributed* ]] 2025-12-04T09:34:52.1826116Z + [[ default == *inductor-halide* ]] 2025-12-04T09:34:52.1826363Z + [[ default == *inductor-pallas* ]] 2025-12-04T09:34:52.1826616Z + [[ default == *inductor-triton-cpu* ]] 2025-12-04T09:34:52.1826886Z + [[ default == *inductor-micro-benchmark* ]] 2025-12-04T09:34:52.1827175Z + [[ default == *aoti_cross_compile_for_windows* ]] 2025-12-04T09:34:52.1827446Z + [[ default == *huggingface* ]] 2025-12-04T09:34:52.1827663Z + [[ default == *timm* ]] 2025-12-04T09:34:52.1827865Z + [[ default == cachebench ]] 2025-12-04T09:34:52.1828081Z + [[ default == verify_cachebench ]] 2025-12-04T09:34:52.1828307Z + [[ default == *torchbench* ]] 2025-12-04T09:34:52.1828535Z + [[ default == *inductor_cpp_wrapper* ]] 2025-12-04T09:34:52.1828778Z + [[ default == *inductor_core* ]] 2025-12-04T09:34:52.1829005Z + [[ default == *inductor* ]] 2025-12-04T09:34:52.1829221Z + [[ default == *einops* ]] 2025-12-04T09:34:52.1829432Z + [[ default == *dynamo_core* ]] 2025-12-04T09:34:52.1829655Z + [[ default == *dynamo_wrapped* ]] 2025-12-04T09:34:52.1829889Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:34:52.1831063Z + [[ -n test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune ]] 2025-12-04T09:34:52.1832365Z + install_torchvision 2025-12-04T09:34:52.1832557Z + local orig_preload 2025-12-04T09:34:52.1832739Z + local commit 2025-12-04T09:34:52.1832930Z ++ get_pinned_commit vision 2025-12-04T09:34:52.1833159Z ++ cat .github/ci_commit_pins/vision.txt 2025-12-04T09:34:52.1837596Z + commit=617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:34:52.1837902Z + orig_preload= 2025-12-04T09:34:52.1838115Z + '[' -n '' ']' 2025-12-04T09:34:52.1838326Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:34:52.1838836Z + pip_build_and_install git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e dist/vision 2025-12-04T09:34:52.1839526Z + local build_target=git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:34:52.1839979Z + local wheel_dir=dist/vision 2025-12-04T09:34:52.1840188Z + local found_whl=0 2025-12-04T09:34:52.1840394Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:34:52.1840624Z + [[ -f dist/vision/*.whl ]] 2025-12-04T09:34:52.1840824Z + '[' 0 == 0 ']' 2025-12-04T09:34:52.1841338Z + python3 -m pip wheel --no-build-isolation --no-deps -w dist/vision git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:34:52.4027890Z Collecting git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:34:52.4030135Z Cloning https://github.com/pytorch/vision.git (to revision 617079d944b0e72632311c30ae2bbdf1168b901e) to /tmp/pip-req-build-lhfhzaxa 2025-12-04T09:34:52.4067575Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-lhfhzaxa 2025-12-04T09:34:56.4510388Z Running command git rev-parse -q --verify 'sha^617079d944b0e72632311c30ae2bbdf1168b901e' 2025-12-04T09:34:56.4547553Z Running command git fetch -q https://github.com/pytorch/vision.git 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:34:56.7573215Z Resolved https://github.com/pytorch/vision.git to commit 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:34:59.4357100Z Preparing metadata (pyproject.toml) ... [?25l- \ | / done 2025-12-04T09:34:59.4383571Z [?25hBuilding wheels for collected packages: torchvision 2025-12-04T09:36:03.5555031Z Building wheel for torchvision (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-12-04T09:36:03.5579385Z [?25h Created wheel for torchvision: filename=torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl size=1808994 sha256=760dcfa71ba13768b692fdc752a194b367a7a0a2d8ae51fb4e41c01feb80ac82 2025-12-04T09:36:03.5580371Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/12/b2/29/1f82685c5b5173629e1f36a9b93989ce92ce563e5fb91d27ac 2025-12-04T09:36:03.5606027Z Successfully built torchvision 2025-12-04T09:36:03.6355155Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:36:03.6355733Z + pip_install_whl dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:36:03.6356247Z + args=('dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl') 2025-12-04T09:36:03.6356601Z + local args 2025-12-04T09:36:03.6356919Z + [[ dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl == *\ * ]] 2025-12-04T09:36:03.6357307Z + for path in "${args[@]}" 2025-12-04T09:36:03.6357702Z + echo 'Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl' 2025-12-04T09:36:03.6358215Z Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:36:03.6358803Z + python3 -mpip install --no-index --no-deps dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:36:03.8593263Z Processing ./dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:36:03.8659213Z Installing collected packages: torchvision 2025-12-04T09:36:04.1605072Z Successfully installed torchvision-0.25.0a0+617079d 2025-12-04T09:36:04.1828861Z + '[' -n '' ']' 2025-12-04T09:36:04.1829429Z + test_python_shard 2 2025-12-04T09:36:04.1830217Z + [[ -z 2 ]] 2025-12-04T09:36:04.1833413Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --exclude-quantization-tests --include test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor inductor/test_kernel_benchmark inductor/test_pad_mm inductor/test_benchmark_fusion inductor/test_aot_inductor inductor/test_torchinductor inductor/test_decompose_mem_bound_mm inductor/test_flex_attention inductor/test_max_autotune --shard 2 2 --verbose --upload-artifacts-while-running 2025-12-04T09:36:06.3871724Z Excluding inductor/test_max_autotune on ROCm 2025-12-04T09:36:07.0343051Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2025-12-04T09:36:07.1966837Z Ignoring disabled issues: [''] 2025-12-04T09:36:07.2033019Z Found test times from artifacts 2025-12-04T09:36:07.2285532Z Found test times from artifacts 2025-12-04T09:36:07.2293503Z Running all tests 2025-12-04T09:36:07.2298538Z Running parallel tests on 1 processes 2025-12-04T09:36:07.2299125Z Name: tests to run (est. time: 96.97min) 2025-12-04T09:36:07.2299590Z Serial tests (16): 2025-12-04T09:36:07.2299989Z inductor/test_aot_inductor 2/4 2025-12-04T09:36:07.2300438Z inductor/test_aot_inductor 4/4 2025-12-04T09:36:07.2300893Z inductor/test_torchinductor 2/2 2025-12-04T09:36:07.2301364Z inductor/test_flex_attention 1/6 2025-12-04T09:36:07.2301801Z inductor/test_flex_attention 3/6 2025-12-04T09:36:07.2302225Z inductor/test_flex_attention 5/6 2025-12-04T09:36:07.2302663Z inductor/test_pad_mm 1/1 2025-12-04T09:36:07.2303090Z inductor/test_benchmark_fusion 1/1 2025-12-04T09:36:07.2303576Z inductor/test_decompose_mem_bound_mm 1/1 2025-12-04T09:36:07.2304039Z test_torch 1/1 2025-12-04T09:36:07.2304367Z test_ops 2/7 2025-12-04T09:36:07.2304689Z test_ops 4/7 2025-12-04T09:36:07.2305002Z test_ops 6/7 2025-12-04T09:36:07.2305304Z test_nn 1/1 2025-12-04T09:36:07.2305633Z test_binary_ufuncs 1/1 2025-12-04T09:36:07.2306017Z test_unary_ufuncs 1/1 2025-12-04T09:36:07.2306393Z Parallel tests (0): 2025-12-04T09:36:07.2306771Z Name: excluded (est. time: 0.0min) 2025-12-04T09:36:07.2307181Z Serial tests (0): 2025-12-04T09:36:07.2307523Z Parallel tests (0): 2025-12-04T09:36:07.2308660Z Running inductor/test_aot_inductor 2/4 ... [2025-12-04 09:36:07.229836][210658.489217115] 2025-12-04T09:36:07.2309384Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:36:07.2310906Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '--shard-id=2', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:36:07.230089] 2025-12-04T09:43:36.8418388Z 2025-12-04T09:43:36.8419213Z inductor/test_aot_inductor 2/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_2.4_0b612f3777922a41_.log 2025-12-04T09:43:36.8498237Z Running 227 items in this shard: test/inductor/test_aot_inductor.py::AOTInductorLoggingTest::test_shape_env_reuse, test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_no_compile_standalone, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_amp_fallback_random_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aot_inductor_consts_cpp_build_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_fp8_dtype_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_sym_inputs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_user_defined_triton_kernel_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_profiler_enable_kernel_profile_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_autotuning_args_reuse_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_backward_no_op_logging_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_bmm_multiple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_3_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_cpu_predicate_cuda_operands_max_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_share_predicate_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_parameters_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dup_unbacked_sym_decl_with_refinement_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fill__fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fp8_view_of_param_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fx_gm_return_tuple_validation_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_dynamic_dim_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_mmaped_weights_on_disk_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_libtorch_free_so_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misaligned_input_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_missing_output_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_non_tensor_input_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_none_args_aot_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_proxy_executor_hann_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeat_output_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_view_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_shape_failed_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scaled_grouped_mm_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_simple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_simple_split_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_size_with_unbacked_add_expr_transitive_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_so_without_weight_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_subclasses_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sym_expr_indexing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sym_i64_input_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_symbool_item_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_autotuning_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_dynamic_launcher_grid_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_arg_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_float_arg_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_with_none_inputs_and_equal_to_1_arg_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_0_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_user_managed_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_weight_on_disk_legacy_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_nested_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_unbacked_symint_closure_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_offset_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_profiler_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_unbacked_symbols_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_size_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_addmm_multiple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_constant_tensor_name_collision_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_assert_tensor_meta_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_autotune_with_constant_folding_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_bool_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_boolean_indexing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_codegen_int_array_var_fix_memory_leak_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_composed_dynamic_size_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_symint_input_disable_one_pass_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_with_parameters_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_constant_folding_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_conv3d_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_conv_freezing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_copy_non_blocking_is_pinned_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_custom_op_in_subgraph_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dynamic_cat_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dynamic_scalar_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_embedding_bag_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_empty_cat_dtype_promotion_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fft_c2c_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_input_codegen_with_sympy_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_issue_140766_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_missing_cubin_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_mixed_device_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_model_modified_weights_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_multi_device_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_nan_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_no_args_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_non_contiguous_output_alias_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_non_tensor_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_none_args_aot_codegen_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_normal_functional_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_repeated_calling_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_reuse_kernel_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_rocm_triton_autotuning_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_fp8_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_same_backing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scaled_grouped_mm_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sdpa_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sdpa_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_shifted_constraint_ranges_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_multi_arch_embed_kernel_binary_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_small_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_stft_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sym_i64_input_codegen_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_symfloat_item_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sympy_cpp_printer_min_max_minmax0_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_autotuning_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_bool_param_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_equal_to_1_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_equal_to_1_float_arg_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_True_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_reinterpret_view_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_upper_bound_i64_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_weight_on_disk_legacy_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_nested_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_conv_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_mixed_device_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_sym_expr_cond_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_with_profiler_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_size_buffer_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_addmm_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aliased_buffer_reuse_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printer_sym_inputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_assert_tensor_meta_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_autotune_with_constant_folding_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_backward_no_op_logging_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_bool_input_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_boolean_indexing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_and_force_mmap_weights_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_composed_dynamic_size_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_cpu_predicate_cuda_operands_max_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_symint_input_disable_one_pass_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_unbacked_symint_closure_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_conv3d_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_conv_freezing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_convolution_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_device_moved_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_duplicate_constant_folding_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_dynamic_cat_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_dynamic_scalar_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_embedding_bag_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_empty_cat_dtype_promotion_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_empty_graph_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fake_tensor_device_validation_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fallback_kernel_with_symexpr_output_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fill__fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fp8_view_of_param_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_free_inactive_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_freezing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_inf_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_issue_140766_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_linear_dynamic_maxautotune_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misc_1_max_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_missing_cubin_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_model_modified_weights_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_multiple_output_alias_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_narrow_fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_output_path_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_output_path_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_poi_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_profile_benchmark_harness_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_quantized_linear_bias_none_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeat_interleave_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeated_calling_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_return_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_reuse_kernel_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_run_with_grad_enabled_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sdpa_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_seq_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_simple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_from_multi_output_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_with_unbacked_add_expr_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_with_unbacked_add_expr_transitive_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_symint_item_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_dynamic_launcher_grid_infer_from_tensor_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_dynamic_grid_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_extern_kernel_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_sympy_expr_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_sympy_fn_like_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_0_use_static_size_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_0_use_static_size_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_2_use_static_size_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_simple_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_conv_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_sym_expr_cond_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_with_profiler_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_grid_with_unbacked_symbols_mps 2025-12-04T09:43:36.8572173Z 2025-12-04T09:43:36.8572417Z Finished inductor/test_aot_inductor 2/4 ... [2025-12-04 09:43:36.841860][211108.10123913], took 7.49min 2025-12-04T09:43:36.8573228Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-77ca8ebcee473ba9.xml 2025-12-04T09:43:38.9319811Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:43:38.9320351Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T09:43:38.9320737Z Uploading artifacts took 0.00 seconds 2025-12-04T09:43:38.9321111Z Running inductor/test_aot_inductor 4/4 ... [2025-12-04 09:43:38.931495][211110.190871989] 2025-12-04T09:43:38.9321496Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:43:38.9322326Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '--shard-id=4', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:43:38.931735] 2025-12-04T09:51:38.8676157Z 2025-12-04T09:51:38.8676971Z inductor/test_aot_inductor 4/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_4.4_0aeda9b9a007adc4_.log 2025-12-04T09:51:38.8755913Z Running 237 items in this shard: test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_cross_compile_windows_package_format, test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_explicit_set, test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_sets_package_cpp, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__int_mm_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_constant_tensor_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printing_model_inputs_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_profiler_enable_kernel_profile_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_autotune_with_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_and_force_mmap_weights_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_mismatched_branch_output_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_non_tensor_predicates_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_symint_input_disable_one_pass_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_unbacked_symint_closure_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_use_buffers_from_outer_scope_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_outer_code_before_after_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_reinterpret_view_inputs_outputs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_consecutive_compiles_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_conv3d_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_conv_freezing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_copy_non_blocking_is_pinned_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dynamic_cat_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dynamic_scalar_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_empty_cat_dtype_promotion_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_empty_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fallback_kernel_with_symexpr_output_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fft_c2c_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_foreach_multiple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fp8_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_freezing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_with_none_index_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_input_codegen_with_sympy_expr_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_issue_140766_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_grid_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_linear_dynamic_maxautotune_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_mixed_device_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_model_modified_weights_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multi_device_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_nested_tensor_from_jagged_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_non_default_gpu_device_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_normal_functional_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_on_gpu_device1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_pad_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_profile_benchmark_harness_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_proxy_executor_abs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_quantized_linear_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeat_interleave_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_replicate_on_devices_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_run_with_grad_enabled_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_same_backing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scaled_dot_product_efficient_attention_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sdpa_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sdpa_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_seq_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_stride_with_unbacked_expr_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_torchvision_transforms_functional_tensor_resize_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_dynamic_launcher_grid_infer_from_tensor_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_dynamic_shape_with_div_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_reinterpret_view_mem_leak_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_constant_buffer_simple_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_using_model_name_for_files_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_sym_expr_cond_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_sym_expr_cond_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_debug_printer_cpp_kernel_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_runtime_asserts_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_autotuning_args_reuse_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_bmm_multiple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_mutation_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_mutation_and_force_mmap_weights_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_clamp_decomposition_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_mismatched_branch_output_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_share_predicate_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_simple_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_symint_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_unbacked_symint_closure_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_unbacked_symint_closure_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_convolution_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_d2h_copy_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_deconv_freezing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dup_unbacked_sym_decl_with_refinement_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_duplicate_constant_folding_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_extract_constants_map_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fake_tensor_device_validation_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fallback_mem_leak_fix_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fp8_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fp8_view_of_param_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_inf_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_int_list_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_grid_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_mmaped_weights_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_mmaped_weights_on_disk_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_weight_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_masked_select_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_misaligned_input_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_misaligned_input_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_output_path_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_pad_non_zero_memory_leak_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_proxy_executor_hann_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_proxy_executor_squeeze_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_quantized_linear_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_replace_unbacked_symbol_with_backed_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_return_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_reuse_kernel_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_run_with_grad_enabled_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_device_type_failed_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_large_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scatter_fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scatter_reduce_fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_seq_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_embed_kernel_binary_True_max_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_from_multi_output_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_transitive_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_so_without_weight_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sym_expr_indexing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sympy_cpp_printer_min_max_minmax1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_torchvision_transforms_functional_tensor_resize_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_dynamic_grid_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_dynamic_shape_with_div_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_sympy_fn_like_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_update_constant_buffer_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_mixed_device_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_parameters_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_unbacked_symint_closure_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_unbacked_symint_closure_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_with_offset_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_grid_with_unbacked_symbols_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__int_mm_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_add_complex_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_constant_tensor_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_constant_tensor_name_collision_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printing_model_inputs_codegen_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_profiler_enable_kernel_profile_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_runtime_asserts_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_autotune_int64_user_defined_triton_kernel_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_bmm_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_3_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_mismatched_branch_output_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_mismatched_branch_output_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_non_tensor_predicates_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_predicate_on_cpu_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_use_buffers_from_outer_scope_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_with_parameters_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_with_reinterpret_view_inputs_outputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_d2h_copy_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_duplicated_params_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_index_put_fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_index_put_with_none_index_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_int_list_input_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_mmaped_weights_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_weight_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_libtorch_free_so_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misaligned_input_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misaligned_input_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misc_1_max_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_nan_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_non_default_gpu_device_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_on_gpu_device1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_quanatized_int8_linear_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_return_view_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_device_type_failed_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_dtype_failed_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_large_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_same_backing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_scatter_reduce_fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_shifted_constraint_ranges_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_subclasses_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sym_expr_indexing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sym_i64_input_codegen_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_symbool_item_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_equal_to_1_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_equal_to_1_float_arg_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_multi_output_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_reinterpret_view_mem_leak_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_1_use_static_size_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_update_inactive_constant_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_update_user_managed_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_nested_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_conv_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_outer_code_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_pytree_inputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_sym_expr_cond_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_unbacked_symint_closure_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_unbacked_symint_closure_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_with_offset_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_grid_with_backed_symbols_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_size_buffer_mps 2025-12-04T09:51:38.8833441Z 2025-12-04T09:51:38.8833683Z Finished inductor/test_aot_inductor 4/4 ... [2025-12-04 09:51:38.867789][211590.127169111], took 8.00min 2025-12-04T09:51:38.8834500Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-77ca8ebcee473ba9.xml 2025-12-04T09:51:38.8835263Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:51:38.8835728Z Running inductor/test_torchinductor 2/2 ... [2025-12-04 09:51:38.874590][211590.133971528] 2025-12-04T09:51:38.8836114Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:51:38.8836921Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:51:38.874809] 2025-12-04T10:01:44.4876513Z 2025-12-04T10:01:44.4877295Z PRINTING LOG FILE of inductor/test_torchinductor 2/2 (test/test-reports/inductor.test_torchinductor_2.2_61b015180443276d_.log) 2025-12-04T10:01:44.4878403Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-610411f96626e4cf.xml 2025-12-04T10:01:44.4879574Z ============================= test session starts ============================== 2025-12-04T10:01:44.4880179Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:01:44.4880708Z cachedir: .pytest_cache 2025-12-04T10:01:44.4881321Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:01:44.4881995Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:01:44.4882302Z configfile: pytest.ini 2025-12-04T10:01:44.4882978Z plugins: hypothesis-6.56.4, xdoctest-1.3.0, subtests-0.13.1, rerunfailures-14.0, xdist-3.3.1, flakefinder-1.1.0, cpp-2.3.0, typeguard-4.3.0 2025-12-04T10:01:44.4883687Z collecting ... collected 999 items 2025-12-04T10:01:44.4884075Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:01:44.5060525Z Running 524 items in this shard: test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_transposed, test/inductor/test_torchinductor.py::GPUTests::test__dyn_quant_matmul_4bit_bf16_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test__dyn_quant_matmul_4bit_fp32_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test__dyn_quant_pack_4bit_weight_fp32_cuda, test/inductor/test_torchinductor.py::GPUTests::test__unsafe_masked_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool_errors_with_long_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex_strided_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_const_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adding_tensor_offsets_cuda, test/inductor/test_torchinductor.py::GPUTests::test_addmv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aliased_buffer_reuse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_allow_reuse_disable_if_exceed_peak_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_cache_hit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_dtype_device_layout_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_override_registration_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_support_out_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_support_str_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin_with_duplicates_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin_with_nan_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_min_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_to_float_cuda, test/inductor/test_torchinductor.py::GPUTests::test_as_strided_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_assert_alignment_op_name_pass_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool_errors_with_uint_cuda, test/inductor/test_torchinductor.py::GPUTests::test_baddbmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bernoulli1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bernoulli2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bfloat16_to_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bitwise3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bitwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bmm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bool_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_broadcast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_nd_tiling_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_nd_tiling_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_batch_norm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_copied_in_graph_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_copied_in_graph_with_different_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_use_after_remove_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_pos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_extern_kernel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_negative_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_of_loops_and_extern_kernel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_legacy_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_upcasting_cuda, test/inductor/test_torchinductor.py::GPUTests::test_check_stack_no_cycles_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clamp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_config_option_dont_assume_alignment_cuda, test/inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_1d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv1d_depthwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv1d_with_permute_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv3d_channels_last_use_block_ptr_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_bn_fuse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_functional_bn_fuse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_inference_heuristics_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_with_as_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_gpu_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_tensor_with_gpu_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_no_mask_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_default_layout_constraint_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_fixed_layout_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_fixed_layout_sequential_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_compiled_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_multi_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_scan_would_split_cuda, test/inductor/test_torchinductor.py::GPUTests::test_deterministic_codegen_cuda, test/inductor/test_torchinductor.py::GPUTests::test_diagonal_copy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dist_bf16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dist_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div9_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_prim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_softmax_symfloat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_deterministic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_trivial_0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtype_sympy_expr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_bfloat16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_bfloat16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_bfloat16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_emulate_precision_triton_fp_fusion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_erfc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_erfinv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_exact_stride_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_list_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_no_mutated_tensors_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fft_real_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fill2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_flexible_layout_immutable_free_symbols_cuda, test/inductor/test_torchinductor.py::GPUTests::test_flip_cat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_flip_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float_index_expression_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float_repr_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fmin_fmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_forced_buffer_realize_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_like_transposed_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fuse_tiled_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_getitem_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gpu_scalar_with_cpu_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_arange1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_constant_tensor2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_misaligned_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_no_inputs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_unbacked_symint_as_output_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardsigmoid_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardswish_cuda, test/inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_abs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_device_assert_masked_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_nested_indirect_indexing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_deterministic_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_reinplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_remainder_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_select_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inductor_multiple_specializations_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inner_fn_str_and_stride_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inner_reduction_detection_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_activations_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_add_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_flip_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_insignificant_strides_cuda, test/inductor/test_torchinductor.py::GPUTests::test_isinf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_issue102546_cuda, test/inductor/test_torchinductor.py::GPUTests::test_kernel_names_cuda, test/inductor/test_torchinductor.py::GPUTests::test_kwargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_l1_loss_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_broadcast_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_grid_use_block_ptr_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_pointwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_strided_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_tensor_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_leaky_relu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lgamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands_sliced_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linalg_eig_stride_consistency_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear_mixed_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_mode_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_regional_compile_flex_attention_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_regional_compile_repeated_blocks_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_triton_kernel_wrapper_functional_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log_fp64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log_softmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logaddexp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logcumsumexp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logcumsumexp_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_masked_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_matmul_layer_norm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_min_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_min_max_reduction_nan_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mix_device_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mixed_mm2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mixed_mm3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mixed_mm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mm_mixed_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mm_views_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mul_index_expr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mul_softmax_symfloat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_gpu_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_gpu_recompile_on_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_var_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_var_lowp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mutable_custom_op_fixed_layout2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mutable_custom_op_fixed_layout_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_False_descending_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_False_descending_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_to_num_cuda, test/inductor/test_torchinductor.py::GPUTests::test_neg_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_neg_max_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nll_loss_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nll_loss_forward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_no_mega_fusion_during_lowering_cuda, test/inductor/test_torchinductor.py::GPUTests::test_no_op_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_no_specization_over_symbolic_value_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nonzero_unbacked_refinement_cuda, test/inductor/test_torchinductor.py::GPUTests::test_norm_constant_overflow_cuda, test/inductor/test_torchinductor.py::GPUTests::test_output_strides_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pattern_matcher_multi_user_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pattern_matcher_unbacked_cuda, test/inductor/test_torchinductor.py::GPUTests::test_permute1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_permute2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_u_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_v_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_w_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_digamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_entr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erfc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_expit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_expm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammainc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaincc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_he_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i0e_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i1e_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_logit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_k0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_ndtr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_polygamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_scaled_modified_bessel_k1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_t_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_u_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_sinc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_xlog1py_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_xlogy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_polar_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow_symfloat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_prepare_softmax_with_fast_math_cuda, test/inductor/test_torchinductor.py::GPUTests::test_profiler_mark_wrapper_call_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_distribution_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_int64_mod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randn_generator_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randn_like_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction_config_limit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reflection_pad2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reinterpret_dtypeview_cuda, test/inductor/test_torchinductor.py::GPUTests::test_relu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_no_ops_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_slice_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_slice_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_view_default_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_view_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_replication_pad_errors_with_bool_cuda, test/inductor/test_torchinductor.py::GPUTests::test_require_stride_expanded_cuda, test/inductor/test_torchinductor.py::GPUTests::test_resize_as_cuda, test/inductor/test_torchinductor.py::GPUTests::test_resize_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reuse_buffers_with_aliasing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_roll_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rsqrt_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scalar_output_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_attention_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_add1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_add3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_freezing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_searchsorted_cuda, test/inductor/test_torchinductor.py::GPUTests::test_select_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sgn_extremal_cuda, test/inductor/test_torchinductor.py::GPUTests::test_shape_padding_cuda, test/inductor/test_torchinductor.py::GPUTests::test_should_pad_bench_for_bmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sigmoid_cuda, test/inductor/test_torchinductor.py::GPUTests::test_signbit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_silu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sin_cuda, test/inductor/test_torchinductor.py::GPUTests::test_single_elem_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter_dtype_consistency_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter_reinplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_backward_data_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_one_kernel_loop_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sort_bool_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sort_transpose_cuda, test/inductor/test_torchinductor.py::GPUTests::test_special_polygamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumprod_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumsum_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumsum_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_reduction_dynamic_shape_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_integer_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze_varargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_std_cuda, test/inductor/test_torchinductor.py::GPUTests::test_stride_preservation_with_stride_modifying_fx_pass_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor_index_slice_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_device_constant_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_memory_format_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transposed_propagates_cuda, test/inductor/test_torchinductor.py::GPUTests::test_triton_argmin_argmax_transpose_logical_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_triu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_uint4x2_mixed_mm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unbacked_floordiv_simplify_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unbacked_floordiv_simplify_errors_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unbind_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unfold_zero_dimension_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unsigned_constant_tensors_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_bfloat16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_bilinear2d_a_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_bilinear2d_b_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest1d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest3d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_var_correction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_var_mean_tile_reduction_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_var_mean_tile_reduction_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vectorized_ops_masked_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vectorized_ops_masked_var_novec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_as_complex_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_detach_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_on_aliased_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_uint8_through_differing_bitwidths_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_weight_norm_bwd_cuda, test/inductor/test_torchinductor.py::GPUTests::test_xblock_divides_xnumel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_zero_dim_reductions_cuda, test/inductor/test_torchinductor.py::GPUTests::test_zero_element_mutation_cuda, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_cant_optimize_compute, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_ctr_not_moved_to_cuda_when_used_in_index_put, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads_block_ptr, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_graph_partition_default_device_context, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_grouped_mm, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_has_constant_mask_block_multiple_False_ynumel_exceed_ygrid_size_False, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_has_constant_mask_block_multiple_True_ynumel_exceed_ygrid_size_False, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_has_constant_mask_block_multiple_True_ynumel_exceed_ygrid_size_True, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_detach_view_backend_inductor, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_sequence_nr, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_not_materialize_pointwise_reduction, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_numpy_on_gpu, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_compute, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_assert, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype_with_constraint, test/inductor/test_torchinductor.py::RNNTest::test_rnn_compile_safe, test/inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_pass 2025-12-04T10:01:44.5183142Z 2025-12-04T10:01:44.5183516Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast1 PASSED [1.3323s] [ 0%] 2025-12-04T10:01:44.5184204Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast3 PASSED [0.1025s] [ 0%] 2025-12-04T10:01:44.5184786Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_dense PASSED [0.1125s] [ 0%] 2025-12-04T10:01:44.5185308Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_double PASSED [0.1075s] [ 0%] 2025-12-04T10:01:44.5185800Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_int PASSED [0.1052s] [ 0%] 2025-12-04T10:01:44.5186292Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast1 PASSED [0.1172s] [ 1%] 2025-12-04T10:01:44.5186801Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast3 PASSED [0.1007s] [ 1%] 2025-12-04T10:01:44.5187287Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_dense PASSED [0.1191s] [ 1%] 2025-12-04T10:01:44.5187767Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_strided PASSED [0.1384s] [ 1%] 2025-12-04T10:01:44.5188257Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast1 PASSED [0.1004s] [ 1%] 2025-12-04T10:01:44.5188749Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast2 PASSED [0.1009s] [ 2%] 2025-12-04T10:01:44.5189515Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast3 PASSED [0.0971s] [ 2%] 2025-12-04T10:01:44.5190012Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_double PASSED [0.1065s] [ 2%] 2025-12-04T10:01:44.5190502Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_strided PASSED [0.3982s] [ 2%] 2025-12-04T10:01:44.5190996Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_transposed PASSED [0.4150s] [ 2%] 2025-12-04T10:01:44.5191485Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast2 PASSED [0.4009s] [ 3%] 2025-12-04T10:01:44.5191949Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_int PASSED [0.3829s] [ 3%] 2025-12-04T10:01:44.5192418Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_transposed PASSED [0.4306s] [ 3%] 2025-12-04T10:01:44.5192902Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast1 PASSED [0.3944s] [ 3%] 2025-12-04T10:01:44.5193451Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast3 PASSED [0.4082s] [ 3%] 2025-12-04T10:01:44.5193927Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_dense PASSED [0.4058s] [ 4%] 2025-12-04T10:01:44.5194551Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast1 PASSED [0.4198s] [ 4%] 2025-12-04T10:01:44.5195017Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast2 PASSED [0.4199s] [ 4%] 2025-12-04T10:01:44.5195472Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_dense PASSED [0.5779s] [ 4%] 2025-12-04T10:01:44.5195922Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_double PASSED [0.4243s] [ 4%] 2025-12-04T10:01:44.5196369Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_int PASSED [0.4108s] [ 4%] 2025-12-04T10:01:44.5196834Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast2 PASSED [0.4317s] [ 5%] 2025-12-04T10:01:44.5197394Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_dense PASSED [0.4189s] [ 5%] 2025-12-04T10:01:44.5197865Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_double PASSED [0.4134s] [ 5%] 2025-12-04T10:01:44.5198331Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_int PASSED [0.4237s] [ 5%] 2025-12-04T10:01:44.5198808Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast1 PASSED [0.4214s] [ 5%] 2025-12-04T10:01:44.5199312Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast3 PASSED [0.4351s] [ 6%] 2025-12-04T10:01:44.5199797Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_dense PASSED [0.4216s] [ 6%] 2025-12-04T10:01:44.5200281Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_strided PASSED [0.4172s] [ 6%] 2025-12-04T10:01:44.5200775Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_transposed PASSED [0.4080s] [ 6%] 2025-12-04T10:01:44.5201384Z inductor/test_torchinductor.py::GPUTests::test__dyn_quant_matmul_4bit_bf16_input_cuda SKIPPED [0.0018s] (No _dyn_quant_matmul_4bit implementation on CUDA) [ 6%] 2025-12-04T10:01:44.5202091Z inductor/test_torchinductor.py::GPUTests::test__dyn_quant_matmul_4bit_fp32_input_cuda SKIPPED [0.0015s] (No _dyn_quant_matmul_4bit implementation on CUDA) [ 7%] 2025-12-04T10:01:44.5202818Z inductor/test_torchinductor.py::GPUTests::test__dyn_quant_pack_4bit_weight_fp32_cuda SKIPPED [0.0015s] (No _dyn_quant_pack_4bit_weight implementation on CUDA) [ 7%] 2025-12-04T10:01:44.5203407Z inductor/test_torchinductor.py::GPUTests::test__unsafe_masked_index_cuda PASSED [0.9380s] [ 7%] 2025-12-04T10:01:44.5203848Z inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d2_cuda PASSED [0.0710s] [ 7%] 2025-12-04T10:01:44.5204306Z inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d_low_prec_cuda PASSED [1.1483s] [ 7%] 2025-12-04T10:01:44.5204899Z inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool_errors_with_long_cuda PASSED [0.4920s] [ 8%] 2025-12-04T10:01:44.5205374Z inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d2_cuda PASSED [0.0707s] [ 8%] 2025-12-04T10:01:44.5205798Z inductor/test_torchinductor.py::GPUTests::test_add_complex3_cuda PASSED [0.6392s] [ 8%] 2025-12-04T10:01:44.5206250Z inductor/test_torchinductor.py::GPUTests::test_add_complex7_cuda PASSED [0.4702s] [ 8%] 2025-12-04T10:01:44.5206648Z inductor/test_torchinductor.py::GPUTests::test_add_complex_cuda PASSED [0.4799s] [ 8%] 2025-12-04T10:01:44.5207085Z inductor/test_torchinductor.py::GPUTests::test_add_complex_strided_fallback_cuda PASSED [0.0650s] [ 8%] 2025-12-04T10:01:44.5207527Z inductor/test_torchinductor.py::GPUTests::test_add_const_int_cuda PASSED [0.9484s] [ 9%] 2025-12-04T10:01:44.5207953Z inductor/test_torchinductor.py::GPUTests::test_adding_tensor_offsets_cuda PASSED [0.1461s] [ 9%] 2025-12-04T10:01:44.5208369Z inductor/test_torchinductor.py::GPUTests::test_addmv_cuda PASSED [0.5892s] [ 9%] 2025-12-04T10:01:44.5208776Z inductor/test_torchinductor.py::GPUTests::test_aliased_buffer_reuse_cuda PASSED [0.3373s] [ 9%] 2025-12-04T10:01:44.5209246Z inductor/test_torchinductor.py::GPUTests::test_allow_reuse_disable_if_exceed_peak_cuda PASSED [0.5757s] [ 9%] 2025-12-04T10:01:44.5209996Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_cache_hit_cuda W1204 09:52:02.016000 100822 site-packages/torch/_export/__init__.py:71] +============================+ 2025-12-04T10:01:44.5210595Z W1204 09:52:02.017000 100822 site-packages/torch/_export/__init__.py:72] | !!! WARNING !!! | 2025-12-04T10:01:44.5211026Z W1204 09:52:02.017000 100822 site-packages/torch/_export/__init__.py:73] +============================+ 2025-12-04T10:01:44.5211879Z W1204 09:52:02.017000 100822 site-packages/torch/_export/__init__.py:74] torch._export.aot_compile()/torch._export.aot_load() is being deprecated, please switch to directly calling torch._inductor.aoti_compile_and_package(torch.export.export())/torch._inductor.aoti_load_package() instead. 2025-12-04T10:01:44.5212631Z PASSED [3.8118s] [ 10%] 2025-12-04T10:01:44.5212948Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_dtype_device_layout_cuda PASSED [3.2097s] [ 10%] 2025-12-04T10:01:44.5213457Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_override_registration_cuda PASSED [16.2379s] [ 10%] 2025-12-04T10:01:44.5213938Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_support_out_cuda PASSED [3.7346s] [ 10%] 2025-12-04T10:01:44.5214428Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_support_str_cuda PASSED [6.2765s] [ 10%] 2025-12-04T10:01:44.5214862Z inductor/test_torchinductor.py::GPUTests::test_arange3_cuda PASSED [0.2114s] [ 11%] 2025-12-04T10:01:44.5215247Z inductor/test_torchinductor.py::GPUTests::test_arange4_cuda PASSED [0.2162s] [ 11%] 2025-12-04T10:01:44.5215626Z inductor/test_torchinductor.py::GPUTests::test_arange5_cuda PASSED [0.1832s] [ 11%] 2025-12-04T10:01:44.5216015Z inductor/test_torchinductor.py::GPUTests::test_argmax_argmin3_cuda PASSED [4.7951s] [ 11%] 2025-12-04T10:01:44.5216464Z inductor/test_torchinductor.py::GPUTests::test_argmax_argmin_with_duplicates_cuda PASSED [1.3948s] [ 11%] 2025-12-04T10:01:44.5216930Z inductor/test_torchinductor.py::GPUTests::test_argmax_argmin_with_nan_cuda PASSED [2.4236s] [ 12%] 2025-12-04T10:01:44.5217359Z inductor/test_torchinductor.py::GPUTests::test_argmax_min_int32_cuda PASSED [0.1756s] [ 12%] 2025-12-04T10:01:44.5217776Z inductor/test_torchinductor.py::GPUTests::test_argmax_to_float_cuda PASSED [0.4768s] [ 12%] 2025-12-04T10:01:44.5218194Z inductor/test_torchinductor.py::GPUTests::test_as_strided_scatter_cuda PASSED [0.3036s] [ 12%] 2025-12-04T10:01:44.5218652Z inductor/test_torchinductor.py::GPUTests::test_assert_alignment_op_name_pass_cuda PASSED [0.0018s] [ 12%] 2025-12-04T10:01:44.5219093Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d1_cuda PASSED [0.4905s] [ 12%] 2025-12-04T10:01:44.5219605Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d2_cuda PASSED [0.5033s] [ 13%] 2025-12-04T10:01:44.5219997Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d3_cuda PASSED [0.8988s] [ 13%] 2025-12-04T10:01:44.5220389Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d5_cuda PASSED [0.6822s] [ 13%] 2025-12-04T10:01:44.5220800Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward2_cuda PASSED [6.0101s] [ 13%] 2025-12-04T10:01:44.5221234Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward3_cuda PASSED [1.0165s] [ 13%] 2025-12-04T10:01:44.5221661Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward4_cuda PASSED [0.0710s] [ 14%] 2025-12-04T10:01:44.5222088Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward_cuda PASSED [0.9698s] [ 14%] 2025-12-04T10:01:44.5222668Z inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward2_cuda SKIPPED [0.0003s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 14%] 2025-12-04T10:01:44.5223234Z inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward3_cuda PASSED [1.5907s] [ 14%] 2025-12-04T10:01:44.5223664Z inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward_cuda PASSED [1.7128s] [ 14%] 2025-12-04T10:01:44.5224475Z inductor/test_torchinductor.py::GPUTests::test_avg_pool_errors_with_uint_cuda E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5225257Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] Traceback (most recent call last): 2025-12-04T10:01:44.5225995Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5226708Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] r = func(*args, **kwargs) 2025-12-04T10:01:44.5227350Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5228009Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5228728Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5229398Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] torch._check( 2025-12-04T10:01:44.5230029Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5230791Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5231568Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5232254Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] raise error_type(message_evaluated) 2025-12-04T10:01:44.5232842Z E1204 09:52:59.453000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] RuntimeError: "avg_pool2d" not implemented for 'torch.uint8' 2025-12-04T10:01:44.5233491Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5234184Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] Traceback (most recent call last): 2025-12-04T10:01:44.5234923Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5235637Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] r = func(*args, **kwargs) 2025-12-04T10:01:44.5236275Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5236930Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5237643Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5238318Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] torch._check( 2025-12-04T10:01:44.5238940Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5239793Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5240558Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5241230Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] raise error_type(message_evaluated) 2025-12-04T10:01:44.5241820Z E1204 09:52:59.461000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] RuntimeError: "avg_pool2d" not implemented for 'torch.uint16' 2025-12-04T10:01:44.5242457Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5243039Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] Traceback (most recent call last): 2025-12-04T10:01:44.5243760Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5244461Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] r = func(*args, **kwargs) 2025-12-04T10:01:44.5245113Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5245761Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5246610Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5247281Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] torch._check( 2025-12-04T10:01:44.5247900Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5248651Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5249552Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5250241Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] raise error_type(message_evaluated) 2025-12-04T10:01:44.5250833Z E1204 09:52:59.468000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] RuntimeError: "avg_pool2d" not implemented for 'torch.uint32' 2025-12-04T10:01:44.5251481Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5252061Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] Traceback (most recent call last): 2025-12-04T10:01:44.5252792Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5253492Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] r = func(*args, **kwargs) 2025-12-04T10:01:44.5254124Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5254889Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5255596Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5256264Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] torch._check( 2025-12-04T10:01:44.5256891Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5257651Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5258418Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5259089Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] raise error_type(message_evaluated) 2025-12-04T10:01:44.5259673Z E1204 09:52:59.475000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] RuntimeError: "avg_pool2d" not implemented for 'torch.uint64' 2025-12-04T10:01:44.5260311Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5260887Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] Traceback (most recent call last): 2025-12-04T10:01:44.5261603Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5262311Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] r = func(*args, **kwargs) 2025-12-04T10:01:44.5262951Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5263598Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5264404Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5265083Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] torch._check( 2025-12-04T10:01:44.5265702Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5266451Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5267211Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5267886Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] raise error_type(message_evaluated) 2025-12-04T10:01:44.5268475Z E1204 09:52:59.497000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] RuntimeError: "avg_pool2d" not implemented for 'torch.uint8' 2025-12-04T10:01:44.5269201Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5269778Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] Traceback (most recent call last): 2025-12-04T10:01:44.5270495Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5271198Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] r = func(*args, **kwargs) 2025-12-04T10:01:44.5271834Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5272479Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5273191Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5273859Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] torch._check( 2025-12-04T10:01:44.5274473Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5275234Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5276004Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5276680Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] raise error_type(message_evaluated) 2025-12-04T10:01:44.5277270Z E1204 09:52:59.506000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] RuntimeError: "avg_pool2d" not implemented for 'torch.uint16' 2025-12-04T10:01:44.5277912Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5278491Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] Traceback (most recent call last): 2025-12-04T10:01:44.5279301Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5280016Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] r = func(*args, **kwargs) 2025-12-04T10:01:44.5280651Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5281300Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5282010Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5282682Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] torch._check( 2025-12-04T10:01:44.5283303Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5284140Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5284905Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5285580Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] raise error_type(message_evaluated) 2025-12-04T10:01:44.5286210Z E1204 09:52:59.516000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] RuntimeError: "avg_pool2d" not implemented for 'torch.uint32' 2025-12-04T10:01:44.5286848Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:01:44.5287431Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] Traceback (most recent call last): 2025-12-04T10:01:44.5288145Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5288842Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] r = func(*args, **kwargs) 2025-12-04T10:01:44.5289474Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5290129Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5290831Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:01:44.5291500Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] torch._check( 2025-12-04T10:01:44.5292116Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5292867Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5293759Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5294434Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] raise error_type(message_evaluated) 2025-12-04T10:01:44.5295027Z E1204 09:52:59.525000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] RuntimeError: "avg_pool2d" not implemented for 'torch.uint64' 2025-12-04T10:01:44.5295660Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:01:44.5296236Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] Traceback (most recent call last): 2025-12-04T10:01:44.5296951Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5297656Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] r = func(*args, **kwargs) 2025-12-04T10:01:44.5298302Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5299067Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5299756Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:01:44.5300433Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] result = fn(*args, **kwargs) 2025-12-04T10:01:44.5301139Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:01:44.5301807Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] torch._check( 2025-12-04T10:01:44.5302430Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5303185Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5303950Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5304634Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] raise error_type(message_evaluated) 2025-12-04T10:01:44.5305234Z E1204 09:52:59.542000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] RuntimeError: "avg_pool3d" not implemented for 'torch.uint8' 2025-12-04T10:01:44.5305869Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:01:44.5306451Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] Traceback (most recent call last): 2025-12-04T10:01:44.5307165Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5307872Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] r = func(*args, **kwargs) 2025-12-04T10:01:44.5308600Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5309260Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5309949Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:01:44.5310627Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] result = fn(*args, **kwargs) 2025-12-04T10:01:44.5311327Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:01:44.5311998Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] torch._check( 2025-12-04T10:01:44.5312618Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5313377Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5314232Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5314914Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] raise error_type(message_evaluated) 2025-12-04T10:01:44.5315498Z E1204 09:52:59.552000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] RuntimeError: "avg_pool3d" not implemented for 'torch.uint16' 2025-12-04T10:01:44.5316139Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:01:44.5316725Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] Traceback (most recent call last): 2025-12-04T10:01:44.5317455Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5318158Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] r = func(*args, **kwargs) 2025-12-04T10:01:44.5318797Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5319455Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5320142Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:01:44.5320837Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] result = fn(*args, **kwargs) 2025-12-04T10:01:44.5321541Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:01:44.5322216Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] torch._check( 2025-12-04T10:01:44.5322840Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5323688Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5324460Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5325155Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] raise error_type(message_evaluated) 2025-12-04T10:01:44.5325743Z E1204 09:52:59.562000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] RuntimeError: "avg_pool3d" not implemented for 'torch.uint32' 2025-12-04T10:01:44.5326423Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:01:44.5327017Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] Traceback (most recent call last): 2025-12-04T10:01:44.5327738Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:01:44.5328560Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] r = func(*args, **kwargs) 2025-12-04T10:01:44.5329197Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:01:44.5329849Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] return self._op(*args, **kwargs) 2025-12-04T10:01:44.5330541Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:01:44.5331222Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] result = fn(*args, **kwargs) 2025-12-04T10:01:44.5331936Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:01:44.5332618Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] torch._check( 2025-12-04T10:01:44.5333243Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:01:44.5334003Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:01:44.5334780Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:01:44.5335461Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] raise error_type(message_evaluated) 2025-12-04T10:01:44.5336056Z E1204 09:52:59.572000 100822 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] RuntimeError: "avg_pool3d" not implemented for 'torch.uint64' 2025-12-04T10:01:44.5336473Z PASSED [0.1301s] [ 15%] 2025-12-04T10:01:44.5336750Z inductor/test_torchinductor.py::GPUTests::test_baddbmm_cuda PASSED [0.5057s] [ 15%] 2025-12-04T10:01:44.5337145Z inductor/test_torchinductor.py::GPUTests::test_bernoulli1_cuda PASSED [0.7937s] [ 15%] 2025-12-04T10:01:44.5337536Z inductor/test_torchinductor.py::GPUTests::test_bernoulli2_cuda PASSED [0.7341s] [ 15%] 2025-12-04T10:01:44.5338063Z inductor/test_torchinductor.py::GPUTests::test_bfloat16_to_int16_cuda PASSED [0.1266s] [ 15%] 2025-12-04T10:01:44.5338475Z inductor/test_torchinductor.py::GPUTests::test_bitwise3_cuda PASSED [0.1877s] [ 16%] 2025-12-04T10:01:44.5338857Z inductor/test_torchinductor.py::GPUTests::test_bitwise_cuda PASSED [0.1988s] [ 16%] 2025-12-04T10:01:44.5339240Z inductor/test_torchinductor.py::GPUTests::test_bmm1_cuda PASSED [0.2818s] [ 16%] 2025-12-04T10:01:44.5339613Z inductor/test_torchinductor.py::GPUTests::test_bool_cuda PASSED [0.3378s] [ 16%] 2025-12-04T10:01:44.5340019Z inductor/test_torchinductor.py::GPUTests::test_bucketize_broadcast_cuda PASSED [0.2195s] [ 16%] 2025-12-04T10:01:44.5340477Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int16_cuda PASSED [0.5688s] [ 16%] 2025-12-04T10:01:44.5340936Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int32_cuda PASSED [0.5520s] [ 17%] 2025-12-04T10:01:44.5341388Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int64_cuda PASSED [0.5867s] [ 17%] 2025-12-04T10:01:44.5341844Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_int8_cuda PASSED [0.5927s] [ 17%] 2025-12-04T10:01:44.5342295Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int16_uint8_cuda PASSED [0.6339s] [ 17%] 2025-12-04T10:01:44.5342744Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int64_cuda PASSED [0.5841s] [ 17%] 2025-12-04T10:01:44.5343295Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int8_cuda PASSED [0.5567s] [ 18%] 2025-12-04T10:01:44.5343741Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int16_cuda PASSED [0.5707s] [ 18%] 2025-12-04T10:01:44.5344185Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int32_cuda PASSED [0.8392s] [ 18%] 2025-12-04T10:01:44.5344628Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int64_cuda PASSED [0.6108s] [ 18%] 2025-12-04T10:01:44.5345084Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int16_cuda PASSED [0.6311s] [ 18%] 2025-12-04T10:01:44.5345542Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_uint8_cuda PASSED [0.5977s] [ 19%] 2025-12-04T10:01:44.5345994Z inductor/test_torchinductor.py::GPUTests::test_bucketize_nd_tiling_False_cuda PASSED [0.2427s] [ 19%] 2025-12-04T10:01:44.5346451Z inductor/test_torchinductor.py::GPUTests::test_bucketize_nd_tiling_True_cuda PASSED [0.2426s] [ 19%] 2025-12-04T10:01:44.5346890Z inductor/test_torchinductor.py::GPUTests::test_buffer_batch_norm_cuda PASSED [0.9089s] [ 19%] 2025-12-04T10:01:44.5347326Z inductor/test_torchinductor.py::GPUTests::test_buffer_copied_in_graph_cuda PASSED [0.1563s] [ 19%] 2025-12-04T10:01:44.5347823Z inductor/test_torchinductor.py::GPUTests::test_buffer_copied_in_graph_with_different_shapes_cuda PASSED [0.4502s] [ 20%] 2025-12-04T10:01:44.5348322Z inductor/test_torchinductor.py::GPUTests::test_buffer_use_after_remove_cuda PASSED [1.4130s] [ 20%] 2025-12-04T10:01:44.5348757Z inductor/test_torchinductor.py::GPUTests::test_builtins_round_cuda PASSED [0.1740s] [ 20%] 2025-12-04T10:01:44.5349214Z inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_pos_cuda PASSED [0.1360s] [ 20%] 2025-12-04T10:01:44.5349709Z inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_zero_cuda PASSED [0.1210s] [ 20%] 2025-12-04T10:01:44.5350229Z inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_zero_cuda PASSED [0.1033s] [ 20%] 2025-12-04T10:01:44.5350697Z inductor/test_torchinductor.py::GPUTests::test_cat_extern_kernel_cuda PASSED [0.1445s] [ 21%] 2025-12-04T10:01:44.5351111Z inductor/test_torchinductor.py::GPUTests::test_cat_inplace_cuda PASSED [0.5096s] [ 21%] 2025-12-04T10:01:44.5351522Z inductor/test_torchinductor.py::GPUTests::test_cat_negative_dim_cuda PASSED [0.5853s] [ 21%] 2025-12-04T10:01:44.5351976Z inductor/test_torchinductor.py::GPUTests::test_cat_of_loops_and_extern_kernel_cuda PASSED [0.6522s] [ 21%] 2025-12-04T10:01:44.5352410Z inductor/test_torchinductor.py::GPUTests::test_cat_uint8_cuda PASSED [0.1786s] [ 21%] 2025-12-04T10:01:44.5352905Z inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_2d_cuda PASSED [0.4277s] [ 22%] 2025-12-04T10:01:44.5353346Z inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_legacy_empty_cuda PASSED [0.0151s] [ 22%] 2025-12-04T10:01:44.5353788Z inductor/test_torchinductor.py::GPUTests::test_cat_upcasting_cuda PASSED [0.2572s] [ 22%] 2025-12-04T10:01:44.5354213Z inductor/test_torchinductor.py::GPUTests::test_check_stack_no_cycles_cuda PASSED [0.0986s] [ 22%] 2025-12-04T10:01:44.5354626Z inductor/test_torchinductor.py::GPUTests::test_clamp_cuda PASSED [0.2765s] [ 22%] 2025-12-04T10:01:44.5355083Z inductor/test_torchinductor.py::GPUTests::test_config_option_dont_assume_alignment_cuda PASSED [0.4023s] [ 23%] 2025-12-04T10:01:44.5355572Z inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumsum_cuda PASSED [0.2940s] [ 23%] 2025-12-04T10:01:44.5356016Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_1d_cuda PASSED [0.2824s] [ 23%] 2025-12-04T10:01:44.5356430Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_2d_cuda PASSED [0.3107s] [ 23%] 2025-12-04T10:01:44.5356857Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_float64_cuda PASSED [0.1710s] [ 23%] 2025-12-04T10:01:44.5357287Z inductor/test_torchinductor.py::GPUTests::test_conv1d_depthwise_cuda PASSED [0.3122s] [ 24%] 2025-12-04T10:01:44.5357807Z inductor/test_torchinductor.py::GPUTests::test_conv1d_with_permute_cuda PASSED [0.1467s] [ 24%] 2025-12-04T10:01:44.5358374Z inductor/test_torchinductor.py::GPUTests::test_conv3d_channels_last_use_block_ptr_True_cuda SKIPPED [0.0018s] (only support cpu conv3d channels_last) [ 24%] 2025-12-04T10:01:44.5358927Z inductor/test_torchinductor.py::GPUTests::test_conv_backward_cuda PASSED [0.5046s] [ 24%] 2025-12-04T10:01:44.5359461Z inductor/test_torchinductor.py::GPUTests::test_conv_bn_fuse_cuda SKIPPED [0.0018s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 24%] 2025-12-04T10:01:44.5360091Z inductor/test_torchinductor.py::GPUTests::test_conv_functional_bn_fuse_cuda SKIPPED [0.0017s] (only support cpu conv bn test) [ 25%] 2025-12-04T10:01:44.5360759Z inductor/test_torchinductor.py::GPUTests::test_conv_inference_heuristics_cuda SKIPPED [0.0017s] (skipIfRocm: test doesn't currently work on the ROCm stack) [ 25%] 2025-12-04T10:01:44.5361477Z inductor/test_torchinductor.py::GPUTests::test_conv_with_as_strided_cuda SKIPPED [0.0016s] (skipIfRocm: test doesn't currently work on the ROCm stack) [ 25%] 2025-12-04T10:01:44.5362024Z inductor/test_torchinductor.py::GPUTests::test_convolution2_cuda PASSED [0.1576s] [ 25%] 2025-12-04T10:01:44.5362426Z inductor/test_torchinductor.py::GPUTests::test_convolution4_cuda PASSED [2.9307s] [ 25%] 2025-12-04T10:01:44.5363106Z inductor/test_torchinductor.py::GPUTests::test_convolution5_cuda MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 9600, provided ptr: 0 size: 0 2025-12-04T10:01:44.5363889Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 9600, provided ptr: 0 size: 0 2025-12-04T10:01:44.5364521Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 4800, provided ptr: 0 size: 0 2025-12-04T10:01:44.5365157Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 4800, provided ptr: 0 size: 0 2025-12-04T10:01:44.5365552Z PASSED [0.2814s] [ 25%] 2025-12-04T10:01:44.5366063Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:23.894000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5366834Z W1204 09:53:23.895000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5367290Z W1204 09:53:23.895000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5367857Z W1204 09:53:23.895000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5368313Z W1204 09:53:23.896000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5368763Z W1204 09:53:23.896000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5369215Z W1204 09:53:23.896000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5369662Z W1204 09:53:23.897000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5370111Z W1204 09:53:23.897000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5370560Z W1204 09:53:23.898000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5371012Z W1204 09:53:23.898000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5383977Z W1204 09:53:23.898000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5384503Z W1204 09:53:23.899000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5385153Z W1204 09:53:23.899000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5385618Z W1204 09:53:23.899000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5386075Z W1204 09:53:23.900000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5386530Z W1204 09:53:23.900000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5386983Z W1204 09:53:23.900000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5387439Z W1204 09:53:23.901000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5387918Z W1204 09:53:23.901000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5388381Z W1204 09:53:23.901000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5388840Z W1204 09:53:23.902000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5389291Z W1204 09:53:23.902000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5389741Z W1204 09:53:23.902000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5390194Z W1204 09:53:23.903000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5390655Z W1204 09:53:23.903000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5391107Z W1204 09:53:23.903000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5391559Z W1204 09:53:23.904000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5392016Z W1204 09:53:23.904000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5392467Z W1204 09:53:23.905000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5392922Z W1204 09:53:23.905000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5393371Z W1204 09:53:23.905000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5393824Z W1204 09:53:23.906000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5394366Z W1204 09:53:23.906000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5394820Z W1204 09:53:23.906000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5395287Z W1204 09:53:23.907000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5395739Z W1204 09:53:23.907000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5396188Z W1204 09:53:23.907000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5396647Z W1204 09:53:23.908000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5397100Z W1204 09:53:23.908000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5397561Z W1204 09:53:23.908000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5398013Z W1204 09:53:23.909000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5398465Z W1204 09:53:23.909000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5399006Z W1204 09:53:23.909000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5399461Z W1204 09:53:23.910000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5399915Z W1204 09:53:23.910000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5400366Z W1204 09:53:23.911000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5400818Z W1204 09:53:23.911000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5401289Z W1204 09:53:23.911000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5401743Z W1204 09:53:23.912000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5402204Z W1204 09:53:23.912000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5402657Z W1204 09:53:23.912000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5403107Z W1204 09:53:23.913000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5403560Z W1204 09:53:23.913000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5404016Z W1204 09:53:23.913000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5404478Z W1204 09:53:23.914000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5404936Z W1204 09:53:23.914000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5405398Z W1204 09:53:23.914000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5405866Z W1204 09:53:23.915000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5406375Z W1204 09:53:23.915000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5406831Z W1204 09:53:23.915000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5407290Z W1204 09:53:23.916000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5407745Z W1204 09:53:23.916000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5408313Z W1204 09:53:23.916000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5408785Z W1204 09:53:23.917000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5409266Z W1204 09:53:23.917000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5409730Z W1204 09:53:23.918000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5410194Z W1204 09:53:23.918000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5410666Z W1204 09:53:23.918000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5411139Z W1204 09:53:23.919000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5411614Z W1204 09:53:23.919000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5412074Z W1204 09:53:23.919000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5412540Z W1204 09:53:23.920000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5413103Z W1204 09:53:23.920000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5413560Z W1204 09:53:23.920000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5414017Z W1204 09:53:23.921000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5414477Z W1204 09:53:23.921000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5414937Z W1204 09:53:23.921000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5415586Z W1204 09:53:23.922000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5416042Z W1204 09:53:23.922000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5416509Z W1204 09:53:23.922000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5416967Z W1204 09:53:23.923000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5417429Z W1204 09:53:23.923000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5417890Z W1204 09:53:23.923000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5418352Z W1204 09:53:23.924000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5418810Z W1204 09:53:23.924000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5419270Z W1204 09:53:23.925000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5419723Z W1204 09:53:23.925000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5420181Z W1204 09:53:23.925000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5420631Z W1204 09:53:23.926000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5421083Z W1204 09:53:23.926000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5421532Z W1204 09:53:23.926000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5421982Z W1204 09:53:23.927000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5422530Z W1204 09:53:23.927000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5422983Z W1204 09:53:23.927000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5423437Z W1204 09:53:23.928000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5423886Z W1204 09:53:23.928000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5424332Z W1204 09:53:23.928000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5424780Z W1204 09:53:23.929000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5425239Z W1204 09:53:23.929000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5425737Z [W1204 09:53:25.439571160 collection.cpp:1148] Warning: ROCTracer produced duplicate flow start: 1 (function operator()) 2025-12-04T10:01:44.5426117Z ('RERUN', {'yellow': True}) [1.9423s] [ 26%] 2025-12-04T10:01:44.5426684Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:25.830000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5427454Z W1204 09:53:25.830000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5427912Z W1204 09:53:25.831000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5428364Z W1204 09:53:25.831000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5428815Z W1204 09:53:25.832000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5429270Z W1204 09:53:25.832000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5429720Z W1204 09:53:25.832000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5430173Z W1204 09:53:25.833000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5430637Z W1204 09:53:25.833000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5431087Z W1204 09:53:25.833000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5431538Z W1204 09:53:25.834000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5431987Z W1204 09:53:25.834000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5432439Z W1204 09:53:25.834000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5432894Z W1204 09:53:25.835000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5433345Z W1204 09:53:25.835000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5433806Z W1204 09:53:25.835000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5434258Z W1204 09:53:25.836000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5434719Z W1204 09:53:25.836000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5435177Z W1204 09:53:25.836000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5435629Z W1204 09:53:25.837000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5436186Z W1204 09:53:25.837000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5436638Z W1204 09:53:25.838000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5437093Z W1204 09:53:25.838000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5437548Z W1204 09:53:25.838000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5437995Z W1204 09:53:25.839000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5438442Z W1204 09:53:25.839000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5438892Z W1204 09:53:25.839000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5439346Z W1204 09:53:25.840000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5439798Z W1204 09:53:25.840000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5440250Z W1204 09:53:25.840000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5440790Z W1204 09:53:25.841000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5441240Z W1204 09:53:25.841000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5441693Z W1204 09:53:25.841000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5442141Z W1204 09:53:25.842000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5442590Z W1204 09:53:25.842000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5443048Z W1204 09:53:25.842000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5443497Z W1204 09:53:25.843000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5443948Z W1204 09:53:25.843000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5444404Z W1204 09:53:25.843000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5444851Z W1204 09:53:25.844000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5445297Z W1204 09:53:25.844000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5445747Z W1204 09:53:25.845000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5446242Z W1204 09:53:25.845000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5446697Z W1204 09:53:25.845000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5447152Z W1204 09:53:25.846000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5447606Z W1204 09:53:25.846000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5448060Z W1204 09:53:25.846000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5448510Z W1204 09:53:25.847000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5448960Z W1204 09:53:25.847000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5449412Z W1204 09:53:25.847000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5449979Z W1204 09:53:25.848000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5450432Z W1204 09:53:25.848000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5450879Z W1204 09:53:25.848000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5451336Z W1204 09:53:25.849000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5451788Z W1204 09:53:25.849000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5452237Z W1204 09:53:25.849000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5452690Z W1204 09:53:25.850000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5453143Z W1204 09:53:25.850000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5453601Z W1204 09:53:25.850000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5454054Z W1204 09:53:25.851000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5454610Z W1204 09:53:25.851000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5455058Z W1204 09:53:25.852000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5455507Z W1204 09:53:25.852000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5455958Z W1204 09:53:25.852000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5456406Z W1204 09:53:25.853000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5456854Z W1204 09:53:25.853000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5457302Z W1204 09:53:25.853000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5457750Z W1204 09:53:25.854000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5458202Z W1204 09:53:25.854000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5458650Z W1204 09:53:25.854000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5459101Z W1204 09:53:25.855000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5459548Z W1204 09:53:25.855000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5459996Z W1204 09:53:25.855000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5460452Z W1204 09:53:25.856000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5460901Z W1204 09:53:25.856000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5461361Z W1204 09:53:25.856000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5461811Z W1204 09:53:25.857000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5462260Z W1204 09:53:25.857000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5462703Z W1204 09:53:25.857000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5463148Z W1204 09:53:25.858000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5463688Z W1204 09:53:25.858000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5464137Z W1204 09:53:25.858000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5464584Z W1204 09:53:25.859000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5465030Z W1204 09:53:25.859000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5465479Z W1204 09:53:25.859000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5465924Z W1204 09:53:25.860000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5466368Z W1204 09:53:25.860000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5466810Z W1204 09:53:25.861000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5467260Z W1204 09:53:25.861000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5467704Z W1204 09:53:25.861000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5468320Z W1204 09:53:25.862000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5468763Z W1204 09:53:25.862000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5469209Z W1204 09:53:25.862000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5469656Z W1204 09:53:25.863000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5470098Z W1204 09:53:25.863000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5470548Z W1204 09:53:25.863000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5470995Z W1204 09:53:25.864000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5471439Z W1204 09:53:25.864000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5471889Z W1204 09:53:25.864000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5472333Z W1204 09:53:25.865000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5472662Z ('RERUN', {'yellow': True}) [1.9243s] [ 26%] 2025-12-04T10:01:44.5473206Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:27.756000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5473870Z W1204 09:53:27.757000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5474319Z W1204 09:53:27.757000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5474767Z W1204 09:53:27.757000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5475214Z W1204 09:53:27.758000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5475660Z W1204 09:53:27.758000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5476112Z W1204 09:53:27.759000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5476563Z W1204 09:53:27.759000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5477015Z W1204 09:53:27.759000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5477558Z W1204 09:53:27.760000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5478014Z W1204 09:53:27.760000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5478468Z W1204 09:53:27.760000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5478919Z W1204 09:53:27.761000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5479370Z W1204 09:53:27.761000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5479817Z W1204 09:53:27.761000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5480268Z W1204 09:53:27.762000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5480726Z W1204 09:53:27.762000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5481187Z W1204 09:53:27.762000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5481638Z W1204 09:53:27.763000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5482196Z W1204 09:53:27.763000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5482650Z W1204 09:53:27.763000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5483103Z W1204 09:53:27.764000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5483554Z W1204 09:53:27.764000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5484005Z W1204 09:53:27.764000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5484467Z W1204 09:53:27.765000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5484916Z W1204 09:53:27.765000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5485372Z W1204 09:53:27.766000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5485822Z W1204 09:53:27.766000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5486312Z W1204 09:53:27.766000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5486762Z W1204 09:53:27.767000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5487209Z W1204 09:53:27.767000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5487664Z W1204 09:53:27.767000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5488116Z W1204 09:53:27.768000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5488565Z W1204 09:53:27.768000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5489023Z W1204 09:53:27.768000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5489473Z W1204 09:53:27.769000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5489924Z W1204 09:53:27.769000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5490374Z W1204 09:53:27.769000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5490824Z W1204 09:53:27.770000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5491388Z W1204 09:53:27.770000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5491843Z W1204 09:53:27.770000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5492296Z W1204 09:53:27.771000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5492747Z W1204 09:53:27.771000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5493193Z W1204 09:53:27.771000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5493641Z W1204 09:53:27.772000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5494097Z W1204 09:53:27.772000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5494552Z W1204 09:53:27.772000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5495004Z W1204 09:53:27.773000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5495457Z W1204 09:53:27.773000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5496017Z W1204 09:53:27.773000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5496467Z W1204 09:53:27.774000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5496916Z W1204 09:53:27.774000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5497369Z W1204 09:53:27.775000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5497820Z W1204 09:53:27.775000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5498276Z W1204 09:53:27.775000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5498728Z W1204 09:53:27.776000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5499188Z W1204 09:53:27.776000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5499639Z W1204 09:53:27.776000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5500092Z W1204 09:53:27.777000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5500552Z W1204 09:53:27.777000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5501004Z W1204 09:53:27.777000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5501467Z W1204 09:53:27.778000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5501919Z W1204 09:53:27.778000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5502370Z W1204 09:53:27.778000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5502830Z W1204 09:53:27.779000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5503284Z W1204 09:53:27.779000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5503736Z W1204 09:53:27.779000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5504186Z W1204 09:53:27.780000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5504637Z W1204 09:53:27.780000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5505187Z W1204 09:53:27.780000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5505644Z W1204 09:53:27.781000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5506098Z W1204 09:53:27.781000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5506548Z W1204 09:53:27.782000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5506999Z W1204 09:53:27.782000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5507449Z W1204 09:53:27.782000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5507896Z W1204 09:53:27.783000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5508350Z W1204 09:53:27.783000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5508800Z W1204 09:53:27.783000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5509249Z W1204 09:53:27.784000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5509786Z W1204 09:53:27.784000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5510240Z W1204 09:53:27.784000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5510695Z W1204 09:53:27.785000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5511143Z W1204 09:53:27.785000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5511596Z W1204 09:53:27.785000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5512047Z W1204 09:53:27.786000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5512499Z W1204 09:53:27.786000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5512957Z W1204 09:53:27.786000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5513407Z W1204 09:53:27.787000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5513860Z W1204 09:53:27.787000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5514313Z W1204 09:53:27.788000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5514782Z W1204 09:53:27.788000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5515243Z W1204 09:53:27.788000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5515696Z W1204 09:53:27.789000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5516148Z W1204 09:53:27.789000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5516603Z W1204 09:53:27.789000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5517054Z W1204 09:53:27.790000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5517506Z W1204 09:53:27.790000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5517960Z W1204 09:53:27.790000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5518502Z W1204 09:53:27.791000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5518957Z W1204 09:53:27.791000 100822 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5519280Z FAILED [2.2654s] [ 26%] 2025-12-04T10:01:44.5519387Z 2025-12-04T10:01:44.5519478Z ==================================== RERUNS ==================================== 2025-12-04T10:01:44.5519779Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5520064Z Traceback (most recent call last): 2025-12-04T10:01:44.5520458Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5520871Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5521266Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5521657Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5522091Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5522548Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5522810Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5523048Z 2025-12-04T10:01:44.5523121Z Mismatched elements: 24968 / 100000 (25.0%) 2025-12-04T10:01:44.5523410Z Greatest absolute difference: 0.9912109375 at index (97575,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5523771Z Greatest relative difference: inf at index (77101,) (up to 0.001 allowed) 2025-12-04T10:01:44.5523976Z 2025-12-04T10:01:44.5524090Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5524544Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5524879Z 2025-12-04T10:01:44.5525024Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5525355Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5525607Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5525810Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5526087Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5526414Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5526639Z graph_break [] 2025-12-04T10:01:44.5526842Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5527593Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5528278Z _warn_once( 2025-12-04T10:01:44.5528496Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5528772Z Traceback (most recent call last): 2025-12-04T10:01:44.5529160Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5529569Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5529949Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5530332Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5530756Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5531202Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5531453Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5531599Z 2025-12-04T10:01:44.5531676Z Mismatched elements: 82920 / 100000 (82.9%) 2025-12-04T10:01:44.5532080Z Greatest absolute difference: 0.9990234375 at index (56803,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5532443Z Greatest relative difference: inf at index (12573,) (up to 0.001 allowed) 2025-12-04T10:01:44.5532645Z 2025-12-04T10:01:44.5532764Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5533214Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5533545Z 2025-12-04T10:01:44.5533688Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5534010Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5534254Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5534459Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5534734Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5535022Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5535262Z graph_break [] 2025-12-04T10:01:44.5535462Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5536206Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5537005Z _warn_once( 2025-12-04T10:01:44.5537199Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5537440Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5537643Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5537912Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5538193Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5538412Z graph_break [] 2025-12-04T10:01:44.5538571Z =================================== FAILURES =================================== 2025-12-04T10:01:44.5538859Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5539133Z Traceback (most recent call last): 2025-12-04T10:01:44.5539514Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5539925Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5540300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5540680Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5541104Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5541548Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5541799Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5541936Z 2025-12-04T10:01:44.5542014Z Mismatched elements: 77925 / 100000 (77.9%) 2025-12-04T10:01:44.5542300Z Greatest absolute difference: 0.99658203125 at index (63516,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5542656Z Greatest relative difference: inf at index (23195,) (up to 0.001 allowed) 2025-12-04T10:01:44.5542859Z 2025-12-04T10:01:44.5542977Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5543426Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5543756Z 2025-12-04T10:01:44.5543898Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5544217Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5544463Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5544665Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5545024Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5545320Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5545542Z graph_break [] 2025-12-04T10:01:44.5545735Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5546491Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5547164Z _warn_once( 2025-12-04T10:01:44.5547357Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5547599Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5547798Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5548064Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5548343Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5548568Z graph_break [] 2025-12-04T10:01:44.5548756Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5548997Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5549195Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5549547Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5549823Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5550044Z graph_break [] 2025-12-04T10:01:44.5550507Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-610411f96626e4cf.xml - 2025-12-04T10:01:44.5551048Z =========================== short test summary info ============================ 2025-12-04T10:01:44.5551533Z FAILED [2.2654s] inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda - AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5551922Z 2025-12-04T10:01:44.5552003Z Mismatched elements: 77925 / 100000 (77.9%) 2025-12-04T10:01:44.5552289Z Greatest absolute difference: 0.99658203125 at index (63516,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5552647Z Greatest relative difference: inf at index (23195,) (up to 0.001 allowed) 2025-12-04T10:01:44.5552853Z 2025-12-04T10:01:44.5552971Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5553427Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5553757Z 2025-12-04T10:01:44.5553901Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5554199Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:01:44.5554480Z ======== 1 failed, 127 passed, 9 skipped, 2 rerun in 106.36s (0:01:46) ========= 2025-12-04T10:01:44.5554719Z Got exit code 1 2025-12-04T10:01:44.5554866Z Retrying single test... 2025-12-04T10:01:44.5555271Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-6181e92a6661fb27.xml 2025-12-04T10:01:44.5555719Z ============================= test session starts ============================== 2025-12-04T10:01:44.5556062Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:01:44.5556368Z cachedir: .pytest_cache 2025-12-04T10:01:44.5556723Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:01:44.5557118Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:01:44.5557300Z configfile: pytest.ini 2025-12-04T10:01:44.5557667Z plugins: hypothesis-6.56.4, xdoctest-1.3.0, subtests-0.13.1, rerunfailures-14.0, xdist-3.3.1, flakefinder-1.1.0, cpp-2.3.0, typeguard-4.3.0 2025-12-04T10:01:44.5558121Z collecting ... collected 999 items / 523 deselected / 476 selected 2025-12-04T10:01:44.5558756Z stepcurrent: skipping 136 already run items. Running only test/inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5559218Z Running 1 items in this shard 2025-12-04T10:01:44.5559333Z 2025-12-04T10:01:44.5559749Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:37.148000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5560426Z W1204 09:53:37.148000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5560894Z W1204 09:53:37.149000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5561349Z W1204 09:53:37.149000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5561805Z W1204 09:53:37.149000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5562256Z W1204 09:53:37.150000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5562708Z W1204 09:53:37.150000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5563245Z W1204 09:53:37.150000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5563697Z W1204 09:53:37.151000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5564147Z W1204 09:53:37.151000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5564597Z W1204 09:53:37.151000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5565048Z W1204 09:53:37.152000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5565505Z W1204 09:53:37.152000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5565957Z W1204 09:53:37.152000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5566455Z W1204 09:53:37.153000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5566905Z W1204 09:53:37.153000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5567359Z W1204 09:53:37.153000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5567809Z W1204 09:53:37.154000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5568259Z W1204 09:53:37.154000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5568713Z W1204 09:53:37.155000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5569166Z W1204 09:53:37.155000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5569617Z W1204 09:53:37.155000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5570091Z W1204 09:53:37.156000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5570552Z W1204 09:53:37.156000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5571008Z W1204 09:53:37.156000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5571460Z W1204 09:53:37.157000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5571910Z W1204 09:53:37.157000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5572473Z W1204 09:53:37.157000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5572923Z W1204 09:53:37.158000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5573375Z W1204 09:53:37.158000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5573823Z W1204 09:53:37.158000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5574276Z W1204 09:53:37.159000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5574756Z W1204 09:53:37.159000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5575340Z W1204 09:53:37.159000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5575799Z W1204 09:53:37.160000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5576254Z W1204 09:53:37.160000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5576705Z W1204 09:53:37.160000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5577275Z W1204 09:53:37.161000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5577722Z W1204 09:53:37.161000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5578175Z W1204 09:53:37.161000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5578625Z W1204 09:53:37.162000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5579074Z W1204 09:53:37.162000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5579529Z W1204 09:53:37.163000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5579982Z W1204 09:53:37.163000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5580436Z W1204 09:53:37.163000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5580887Z W1204 09:53:37.164000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5581339Z W1204 09:53:37.164000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5581792Z W1204 09:53:37.164000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5582245Z W1204 09:53:37.165000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5582698Z W1204 09:53:37.165000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5583149Z W1204 09:53:37.165000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5583601Z W1204 09:53:37.166000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5584058Z W1204 09:53:37.166000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5584510Z W1204 09:53:37.167000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5584964Z W1204 09:53:37.167000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5585428Z W1204 09:53:37.167000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5585882Z W1204 09:53:37.168000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5586445Z W1204 09:53:37.168000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5586900Z W1204 09:53:37.168000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5587353Z W1204 09:53:37.169000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5587804Z W1204 09:53:37.169000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5588252Z W1204 09:53:37.169000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5588704Z W1204 09:53:37.170000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5589154Z W1204 09:53:37.170000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5589607Z W1204 09:53:37.170000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5590061Z W1204 09:53:37.171000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5590513Z W1204 09:53:37.171000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5591058Z W1204 09:53:37.171000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5591512Z W1204 09:53:37.172000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5591962Z W1204 09:53:37.172000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5592412Z W1204 09:53:37.172000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5592865Z W1204 09:53:37.173000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5593320Z W1204 09:53:37.173000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5593772Z W1204 09:53:37.173000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5594227Z W1204 09:53:37.174000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5594680Z W1204 09:53:37.174000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5595134Z W1204 09:53:37.175000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5595584Z W1204 09:53:37.175000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5596039Z W1204 09:53:37.175000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5596493Z W1204 09:53:37.176000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5596949Z W1204 09:53:37.176000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5597397Z W1204 09:53:37.176000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5597859Z W1204 09:53:37.177000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5598310Z W1204 09:53:37.177000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5598762Z W1204 09:53:37.177000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5599215Z W1204 09:53:37.178000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5599664Z W1204 09:53:37.178000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5600206Z W1204 09:53:37.178000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5600661Z W1204 09:53:37.179000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5601118Z W1204 09:53:37.179000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5601567Z W1204 09:53:37.179000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5602018Z W1204 09:53:37.180000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5602468Z W1204 09:53:37.180000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5602920Z W1204 09:53:37.180000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5603374Z W1204 09:53:37.181000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5603827Z W1204 09:53:37.181000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5604283Z W1204 09:53:37.182000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5604823Z W1204 09:53:37.182000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5605276Z W1204 09:53:37.182000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5605729Z W1204 09:53:37.183000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5606272Z [W1204 09:53:38.118782523 collection.cpp:1148] Warning: ROCTracer produced duplicate flow start: 1 (function operator()) 2025-12-04T10:01:44.5606649Z ('RERUN', {'yellow': True}) [3.2937s] [100%] 2025-12-04T10:01:44.5607201Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:39.519000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5607865Z W1204 09:53:39.519000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5608330Z W1204 09:53:39.520000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5608785Z W1204 09:53:39.520000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5609236Z W1204 09:53:39.521000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5609688Z W1204 09:53:39.521000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5610141Z W1204 09:53:39.521000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5610596Z W1204 09:53:39.522000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5611051Z W1204 09:53:39.522000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5611514Z W1204 09:53:39.522000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5611965Z W1204 09:53:39.523000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5612414Z W1204 09:53:39.523000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5612871Z W1204 09:53:39.523000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5613320Z W1204 09:53:39.524000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5613884Z W1204 09:53:39.524000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5614349Z W1204 09:53:39.525000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5614808Z W1204 09:53:39.525000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5615264Z W1204 09:53:39.525000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5615720Z W1204 09:53:39.526000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5616173Z W1204 09:53:39.526000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5616629Z W1204 09:53:39.526000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5617090Z W1204 09:53:39.527000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5617548Z W1204 09:53:39.527000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5618004Z W1204 09:53:39.527000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5618565Z W1204 09:53:39.528000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5619020Z W1204 09:53:39.528000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5619472Z W1204 09:53:39.528000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5619928Z W1204 09:53:39.529000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5620380Z W1204 09:53:39.529000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5620835Z W1204 09:53:39.529000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5621292Z W1204 09:53:39.530000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5621747Z W1204 09:53:39.530000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5622204Z W1204 09:53:39.530000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5622660Z W1204 09:53:39.531000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5623112Z W1204 09:53:39.531000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5623564Z W1204 09:53:39.532000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5624021Z W1204 09:53:39.532000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5624482Z W1204 09:53:39.532000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5624938Z W1204 09:53:39.533000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5625401Z W1204 09:53:39.533000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5625856Z W1204 09:53:39.533000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5626313Z W1204 09:53:39.534000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5626772Z W1204 09:53:39.534000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5627233Z W1204 09:53:39.534000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5627786Z W1204 09:53:39.535000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5628246Z W1204 09:53:39.535000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5628695Z W1204 09:53:39.535000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5629149Z W1204 09:53:39.536000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5629600Z W1204 09:53:39.536000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5630052Z W1204 09:53:39.536000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5630502Z W1204 09:53:39.537000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5630958Z W1204 09:53:39.537000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5631414Z W1204 09:53:39.538000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5631864Z W1204 09:53:39.538000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5632409Z W1204 09:53:39.538000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5632864Z W1204 09:53:39.539000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5633315Z W1204 09:53:39.539000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5633767Z W1204 09:53:39.539000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5634218Z W1204 09:53:39.540000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5634675Z W1204 09:53:39.540000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5635128Z W1204 09:53:39.540000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5635588Z W1204 09:53:39.541000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5636048Z W1204 09:53:39.541000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5636502Z W1204 09:53:39.541000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5636956Z W1204 09:53:39.542000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5637408Z W1204 09:53:39.542000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5637869Z W1204 09:53:39.542000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5638326Z W1204 09:53:39.543000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5638778Z W1204 09:53:39.543000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5639239Z W1204 09:53:39.543000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5639690Z W1204 09:53:39.544000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5640146Z W1204 09:53:39.544000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5640598Z W1204 09:53:39.544000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5641052Z W1204 09:53:39.545000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5641615Z W1204 09:53:39.545000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5642074Z W1204 09:53:39.546000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5642532Z W1204 09:53:39.546000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5642987Z W1204 09:53:39.546000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5643441Z W1204 09:53:39.547000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5643900Z W1204 09:53:39.547000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5644353Z W1204 09:53:39.547000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5644810Z W1204 09:53:39.548000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5645269Z W1204 09:53:39.548000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5645732Z W1204 09:53:39.548000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5646326Z W1204 09:53:39.549000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5646781Z W1204 09:53:39.549000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5647236Z W1204 09:53:39.549000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5647691Z W1204 09:53:39.550000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5648149Z W1204 09:53:39.550000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5648604Z W1204 09:53:39.550000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5649058Z W1204 09:53:39.551000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5649509Z W1204 09:53:39.551000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5649969Z W1204 09:53:39.551000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5650418Z W1204 09:53:39.552000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5650870Z W1204 09:53:39.552000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5651320Z W1204 09:53:39.552000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5651773Z W1204 09:53:39.553000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5652228Z W1204 09:53:39.553000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5652681Z W1204 09:53:39.554000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5653140Z W1204 09:53:39.554000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5653475Z ('RERUN', {'yellow': True}) [1.9272s] [100%] 2025-12-04T10:01:44.5654035Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:41.447000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5654711Z W1204 09:53:41.448000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5655170Z W1204 09:53:41.448000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5655738Z W1204 09:53:41.448000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5656215Z W1204 09:53:41.449000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5656686Z W1204 09:53:41.449000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5657153Z W1204 09:53:41.450000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5657620Z W1204 09:53:41.450000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5658078Z W1204 09:53:41.450000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5658545Z W1204 09:53:41.451000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5659000Z W1204 09:53:41.451000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5659462Z W1204 09:53:41.451000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5659926Z W1204 09:53:41.452000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5660497Z W1204 09:53:41.452000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5660957Z W1204 09:53:41.452000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5661415Z W1204 09:53:41.453000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5661873Z W1204 09:53:41.453000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5662321Z W1204 09:53:41.453000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5662784Z W1204 09:53:41.454000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5663246Z W1204 09:53:41.454000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5663722Z W1204 09:53:41.454000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5664196Z W1204 09:53:41.455000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5664672Z W1204 09:53:41.455000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5665142Z W1204 09:53:41.456000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5665620Z W1204 09:53:41.456000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5666099Z W1204 09:53:41.456000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5666573Z W1204 09:53:41.457000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5667051Z W1204 09:53:41.457000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5667531Z W1204 09:53:41.457000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5668009Z W1204 09:53:41.458000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5668465Z W1204 09:53:41.458000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5668917Z W1204 09:53:41.458000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5669366Z W1204 09:53:41.459000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5669912Z W1204 09:53:41.459000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5670368Z W1204 09:53:41.459000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5670822Z W1204 09:53:41.460000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5671274Z W1204 09:53:41.460000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5671740Z W1204 09:53:41.460000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5672197Z W1204 09:53:41.461000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5672651Z W1204 09:53:41.461000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5673111Z W1204 09:53:41.461000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5673564Z W1204 09:53:41.462000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5674016Z W1204 09:53:41.462000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5674557Z W1204 09:53:41.462000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5675007Z W1204 09:53:41.463000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5675458Z W1204 09:53:41.463000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5675914Z W1204 09:53:41.464000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5676372Z W1204 09:53:41.464000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5676831Z W1204 09:53:41.464000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5677286Z W1204 09:53:41.465000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5677747Z W1204 09:53:41.465000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5678198Z W1204 09:53:41.465000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5678652Z W1204 09:53:41.466000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5679114Z W1204 09:53:41.466000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5679572Z W1204 09:53:41.466000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5680030Z W1204 09:53:41.467000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5680488Z W1204 09:53:41.467000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5680941Z W1204 09:53:41.467000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5681399Z W1204 09:53:41.468000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5681851Z W1204 09:53:41.468000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5682312Z W1204 09:53:41.468000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5682769Z W1204 09:53:41.469000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5691542Z W1204 09:53:41.469000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5692066Z W1204 09:53:41.469000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5692537Z W1204 09:53:41.470000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5692998Z W1204 09:53:41.470000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5693450Z W1204 09:53:41.470000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5693899Z W1204 09:53:41.471000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5694348Z W1204 09:53:41.471000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5694794Z W1204 09:53:41.472000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5695246Z W1204 09:53:41.472000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5695699Z W1204 09:53:41.472000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5696149Z W1204 09:53:41.473000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5696729Z W1204 09:53:41.473000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5697177Z W1204 09:53:41.473000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5697629Z W1204 09:53:41.474000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5698078Z W1204 09:53:41.474000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5698532Z W1204 09:53:41.474000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5698981Z W1204 09:53:41.475000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5699435Z W1204 09:53:41.475000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5699889Z W1204 09:53:41.475000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5700343Z W1204 09:53:41.476000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5700538Z W1204 09:53:41.476000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5700730Z W1204 09:53:41.476000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5700925Z W1204 09:53:41.477000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5701117Z W1204 09:53:41.477000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5701314Z W1204 09:53:41.477000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5701509Z W1204 09:53:41.478000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5701704Z W1204 09:53:41.478000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5701894Z W1204 09:53:41.478000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5702090Z W1204 09:53:41.479000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5702278Z W1204 09:53:41.479000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5702567Z W1204 09:53:41.479000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5702760Z W1204 09:53:41.480000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5702957Z W1204 09:53:41.480000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5703158Z W1204 09:53:41.481000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5703346Z W1204 09:53:41.481000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5703546Z W1204 09:53:41.481000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5703738Z W1204 09:53:41.482000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5703932Z W1204 09:53:41.482000 111907 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5703998Z FAILED [2.1010s] [100%] 2025-12-04T10:01:44.5704003Z 2025-12-04T10:01:44.5704100Z ==================================== RERUNS ==================================== 2025-12-04T10:01:44.5704243Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5704408Z Traceback (most recent call last): 2025-12-04T10:01:44.5704679Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5704774Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5705016Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5705116Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5705399Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5705525Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5705611Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5705615Z 2025-12-04T10:01:44.5705693Z Mismatched elements: 31314 / 100000 (31.3%) 2025-12-04T10:01:44.5705851Z Greatest absolute difference: 0.9912109375 at index (97575,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5705999Z Greatest relative difference: inf at index (70048,) (up to 0.001 allowed) 2025-12-04T10:01:44.5706002Z 2025-12-04T10:01:44.5706123Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5706399Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5706402Z 2025-12-04T10:01:44.5706548Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5706669Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5706740Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5706836Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5706949Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5707064Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5707132Z graph_break [] 2025-12-04T10:01:44.5707246Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5707822Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5707877Z _warn_once( 2025-12-04T10:01:44.5708022Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5708092Z Traceback (most recent call last): 2025-12-04T10:01:44.5708458Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5708549Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5708794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5708888Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5709162Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5709276Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5709358Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5709361Z 2025-12-04T10:01:44.5709436Z Mismatched elements: 79921 / 100000 (79.9%) 2025-12-04T10:01:44.5709591Z Greatest absolute difference: 0.9951171875 at index (54644,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5709733Z Greatest relative difference: inf at index (24455,) (up to 0.001 allowed) 2025-12-04T10:01:44.5709737Z 2025-12-04T10:01:44.5709853Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5710134Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5710137Z 2025-12-04T10:01:44.5710361Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5710484Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5710550Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5710646Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5710750Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5710871Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5710926Z graph_break [] 2025-12-04T10:01:44.5711046Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5711615Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5711678Z _warn_once( 2025-12-04T10:01:44.5711795Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5711869Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5711956Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5712074Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5712179Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5712241Z graph_break [] 2025-12-04T10:01:44.5712322Z =================================== FAILURES =================================== 2025-12-04T10:01:44.5712469Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5712548Z Traceback (most recent call last): 2025-12-04T10:01:44.5712809Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5712902Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5713139Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5713230Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5713496Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5713612Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5713693Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5713696Z 2025-12-04T10:01:44.5713775Z Mismatched elements: 44951 / 100000 (45.0%) 2025-12-04T10:01:44.5713929Z Greatest absolute difference: 0.9970703125 at index (67001,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5714160Z Greatest relative difference: inf at index (55989,) (up to 0.001 allowed) 2025-12-04T10:01:44.5714164Z 2025-12-04T10:01:44.5714280Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5714558Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5714566Z 2025-12-04T10:01:44.5714703Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5714824Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5714885Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5714972Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5715079Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5715197Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5715252Z graph_break [] 2025-12-04T10:01:44.5715370Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5715937Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5716076Z _warn_once( 2025-12-04T10:01:44.5716194Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5716258Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5716351Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5716464Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5716574Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5716632Z graph_break [] 2025-12-04T10:01:44.5716745Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5716809Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5716899Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5717006Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5717107Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5717172Z graph_break [] 2025-12-04T10:01:44.5717551Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-6181e92a6661fb27.xml - 2025-12-04T10:01:44.5717647Z =========================== short test summary info ============================ 2025-12-04T10:01:44.5717973Z FAILED [2.1010s] inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda - AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5717977Z 2025-12-04T10:01:44.5718050Z Mismatched elements: 44951 / 100000 (45.0%) 2025-12-04T10:01:44.5718205Z Greatest absolute difference: 0.9970703125 at index (67001,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5718344Z Greatest relative difference: inf at index (55989,) (up to 0.001 allowed) 2025-12-04T10:01:44.5718347Z 2025-12-04T10:01:44.5718458Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5718739Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5718742Z 2025-12-04T10:01:44.5718882Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5718976Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:01:44.5719087Z ================== 1 failed, 523 deselected, 2 rerun in 7.37s ================== 2025-12-04T10:01:44.5719141Z Got exit code 1 2025-12-04T10:01:44.5719206Z Retrying single test... 2025-12-04T10:01:44.5719500Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-62e64dadea44383e.xml 2025-12-04T10:01:44.5719689Z ============================= test session starts ============================== 2025-12-04T10:01:44.5719875Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:01:44.5719944Z cachedir: .pytest_cache 2025-12-04T10:01:44.5720207Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:01:44.5720279Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:01:44.5720339Z configfile: pytest.ini 2025-12-04T10:01:44.5720615Z plugins: hypothesis-6.56.4, xdoctest-1.3.0, subtests-0.13.1, rerunfailures-14.0, xdist-3.3.1, flakefinder-1.1.0, cpp-2.3.0, typeguard-4.3.0 2025-12-04T10:01:44.5720736Z collecting ... collected 999 items / 523 deselected / 476 selected 2025-12-04T10:01:44.5721081Z stepcurrent: skipping 136 already run items. Running only test/inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5721150Z Running 1 items in this shard 2025-12-04T10:01:44.5721153Z 2025-12-04T10:01:44.5721573Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:49.887000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5721882Z W1204 09:53:49.888000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5722078Z W1204 09:53:49.888000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5722272Z W1204 09:53:49.888000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5722461Z W1204 09:53:49.889000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5722657Z W1204 09:53:49.889000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5722851Z W1204 09:53:49.889000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5723047Z W1204 09:53:49.890000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5723239Z W1204 09:53:49.890000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5723433Z W1204 09:53:49.890000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5723622Z W1204 09:53:49.891000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5723815Z W1204 09:53:49.891000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5724002Z W1204 09:53:49.892000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5724201Z W1204 09:53:49.892000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5724391Z W1204 09:53:49.892000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5724584Z W1204 09:53:49.893000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5724782Z W1204 09:53:49.893000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5724972Z W1204 09:53:49.893000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5725171Z W1204 09:53:49.894000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5725361Z W1204 09:53:49.894000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5725554Z W1204 09:53:49.894000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5725833Z W1204 09:53:49.895000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5726029Z W1204 09:53:49.895000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5726436Z W1204 09:53:49.895000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5726634Z W1204 09:53:49.896000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5726830Z W1204 09:53:49.896000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5727028Z W1204 09:53:49.896000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5727217Z W1204 09:53:49.897000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5727417Z W1204 09:53:49.897000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5727607Z W1204 09:53:49.897000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5727808Z W1204 09:53:49.898000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5728132Z W1204 09:53:49.898000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5728322Z W1204 09:53:49.898000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5728514Z W1204 09:53:49.899000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5728703Z W1204 09:53:49.899000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5728897Z W1204 09:53:49.899000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5729090Z W1204 09:53:49.900000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5729284Z W1204 09:53:49.900000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5729481Z W1204 09:53:49.901000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5729675Z W1204 09:53:49.901000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5729866Z W1204 09:53:49.901000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5730065Z W1204 09:53:49.902000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5730256Z W1204 09:53:49.902000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5730452Z W1204 09:53:49.902000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5730647Z W1204 09:53:49.903000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5730836Z W1204 09:53:49.903000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5731034Z W1204 09:53:49.903000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5731226Z W1204 09:53:49.904000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5731423Z W1204 09:53:49.904000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5731618Z W1204 09:53:49.904000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5731812Z W1204 09:53:49.905000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5732108Z W1204 09:53:49.905000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5732310Z W1204 09:53:49.905000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5732507Z W1204 09:53:49.906000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5732702Z W1204 09:53:49.906000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5732894Z W1204 09:53:49.906000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5733088Z W1204 09:53:49.907000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5733278Z W1204 09:53:49.907000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5733476Z W1204 09:53:49.908000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5733672Z W1204 09:53:49.908000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5733859Z W1204 09:53:49.908000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5734137Z W1204 09:53:49.909000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5734327Z W1204 09:53:49.909000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5734522Z W1204 09:53:49.909000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5734710Z W1204 09:53:49.910000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5734907Z W1204 09:53:49.910000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5735101Z W1204 09:53:49.910000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5735295Z W1204 09:53:49.911000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5735488Z W1204 09:53:49.911000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5735684Z W1204 09:53:49.911000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5735874Z W1204 09:53:49.912000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5736067Z W1204 09:53:49.912000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5736255Z W1204 09:53:49.912000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5736450Z W1204 09:53:49.913000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5736645Z W1204 09:53:49.913000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5736835Z W1204 09:53:49.913000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5737031Z W1204 09:53:49.914000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5737220Z W1204 09:53:49.914000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5737416Z W1204 09:53:49.914000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5737607Z W1204 09:53:49.915000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5737802Z W1204 09:53:49.915000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5738079Z W1204 09:53:49.915000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5738277Z W1204 09:53:49.916000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5738470Z W1204 09:53:49.916000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5738666Z W1204 09:53:49.917000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5738856Z W1204 09:53:49.917000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5739051Z W1204 09:53:49.917000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5739241Z W1204 09:53:49.918000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5739443Z W1204 09:53:49.918000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5739637Z W1204 09:53:49.918000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5739826Z W1204 09:53:49.919000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5740104Z W1204 09:53:49.919000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5740294Z W1204 09:53:49.919000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5740492Z W1204 09:53:49.920000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5740682Z W1204 09:53:49.920000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5740875Z W1204 09:53:49.920000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5741069Z W1204 09:53:49.921000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5741261Z W1204 09:53:49.921000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5741456Z W1204 09:53:49.921000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5741653Z W1204 09:53:49.922000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5741892Z [W1204 09:53:51.839966637 collection.cpp:1148] Warning: ROCTracer produced duplicate flow start: 1 (function operator()) 2025-12-04T10:01:44.5741975Z ('RERUN', {'yellow': True}) [3.2693s] [100%] 2025-12-04T10:01:44.5742395Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:52.237000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5742593Z W1204 09:53:52.237000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5742786Z W1204 09:53:52.238000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5742983Z W1204 09:53:52.238000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5743179Z W1204 09:53:52.239000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5743373Z W1204 09:53:52.239000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5743581Z W1204 09:53:52.239000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5743776Z W1204 09:53:52.240000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5744073Z W1204 09:53:52.240000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5744267Z W1204 09:53:52.240000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5744465Z W1204 09:53:52.241000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5744661Z W1204 09:53:52.241000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5744857Z W1204 09:53:52.241000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5745046Z W1204 09:53:52.242000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5745243Z W1204 09:53:52.242000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5745438Z W1204 09:53:52.242000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5745633Z W1204 09:53:52.243000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5745829Z W1204 09:53:52.243000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5746109Z W1204 09:53:52.243000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5746305Z W1204 09:53:52.244000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5746495Z W1204 09:53:52.244000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5746690Z W1204 09:53:52.245000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5746882Z W1204 09:53:52.245000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5747079Z W1204 09:53:52.245000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5747268Z W1204 09:53:52.246000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5747465Z W1204 09:53:52.246000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5747656Z W1204 09:53:52.246000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5747849Z W1204 09:53:52.247000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5748039Z W1204 09:53:52.247000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5748232Z W1204 09:53:52.247000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5748425Z W1204 09:53:52.248000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5748616Z W1204 09:53:52.248000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5748815Z W1204 09:53:52.248000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5749005Z W1204 09:53:52.249000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5749200Z W1204 09:53:52.249000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5749391Z W1204 09:53:52.249000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5749585Z W1204 09:53:52.250000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5749776Z W1204 09:53:52.250000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5750056Z W1204 09:53:52.251000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5750251Z W1204 09:53:52.251000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5750446Z W1204 09:53:52.251000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5750641Z W1204 09:53:52.252000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5750837Z W1204 09:53:52.252000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5751032Z W1204 09:53:52.252000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5751221Z W1204 09:53:52.253000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5751415Z W1204 09:53:52.253000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5751608Z W1204 09:53:52.253000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5751805Z W1204 09:53:52.254000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5752079Z W1204 09:53:52.254000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5752274Z W1204 09:53:52.254000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5752462Z W1204 09:53:52.255000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5752655Z W1204 09:53:52.255000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5752851Z W1204 09:53:52.255000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5753043Z W1204 09:53:52.256000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5753235Z W1204 09:53:52.256000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5753428Z W1204 09:53:52.256000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5753626Z W1204 09:53:52.257000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5753818Z W1204 09:53:52.257000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5754013Z W1204 09:53:52.258000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5754202Z W1204 09:53:52.258000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5754398Z W1204 09:53:52.258000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5754584Z W1204 09:53:52.259000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5754778Z W1204 09:53:52.259000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5754972Z W1204 09:53:52.259000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5755159Z W1204 09:53:52.260000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5755350Z W1204 09:53:52.260000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5755539Z W1204 09:53:52.260000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5755730Z W1204 09:53:52.261000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5756006Z W1204 09:53:52.261000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5756210Z W1204 09:53:52.261000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5756393Z W1204 09:53:52.262000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5756578Z W1204 09:53:52.262000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5756756Z W1204 09:53:52.262000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5756937Z W1204 09:53:52.263000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5757115Z W1204 09:53:52.263000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5757301Z W1204 09:53:52.263000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5757479Z W1204 09:53:52.264000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5757661Z W1204 09:53:52.264000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5757920Z W1204 09:53:52.264000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5758097Z W1204 09:53:52.265000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5758276Z W1204 09:53:52.265000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5758454Z W1204 09:53:52.266000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5758639Z W1204 09:53:52.266000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5758820Z W1204 09:53:52.266000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5759000Z W1204 09:53:52.267000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5759180Z W1204 09:53:52.267000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5759362Z W1204 09:53:52.267000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5759541Z W1204 09:53:52.268000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5759725Z W1204 09:53:52.268000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5759904Z W1204 09:53:52.268000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5760090Z W1204 09:53:52.269000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5760271Z W1204 09:53:52.269000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5760453Z W1204 09:53:52.269000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5760641Z W1204 09:53:52.270000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5760819Z W1204 09:53:52.270000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5761001Z W1204 09:53:52.270000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5761181Z W1204 09:53:52.271000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5761364Z W1204 09:53:52.271000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5761635Z W1204 09:53:52.271000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5761822Z W1204 09:53:52.272000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5761895Z ('RERUN', {'yellow': True}) [1.9130s] [100%] 2025-12-04T10:01:44.5762281Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda W1204 09:53:54.147000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5762464Z W1204 09:53:54.147000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5762649Z W1204 09:53:54.148000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5762830Z W1204 09:53:54.148000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5763016Z W1204 09:53:54.148000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5763199Z W1204 09:53:54.149000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5763378Z W1204 09:53:54.149000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5763644Z W1204 09:53:54.149000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5763824Z W1204 09:53:54.150000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5764007Z W1204 09:53:54.150000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5764186Z W1204 09:53:54.151000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5764372Z W1204 09:53:54.151000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5764554Z W1204 09:53:54.151000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5764739Z W1204 09:53:54.152000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5764920Z W1204 09:53:54.152000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5765103Z W1204 09:53:54.152000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5765282Z W1204 09:53:54.153000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5765466Z W1204 09:53:54.153000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5765646Z W1204 09:53:54.153000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5765832Z W1204 09:53:54.154000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5766024Z W1204 09:53:54.154000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5766257Z W1204 09:53:54.154000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5766445Z W1204 09:53:54.155000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5766623Z W1204 09:53:54.155000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5766812Z W1204 09:53:54.155000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5766991Z W1204 09:53:54.156000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5767175Z W1204 09:53:54.156000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5767470Z W1204 09:53:54.156000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5767660Z W1204 09:53:54.157000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5767844Z W1204 09:53:54.157000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5768026Z W1204 09:53:54.158000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5768222Z W1204 09:53:54.158000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5768417Z W1204 09:53:54.158000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5768607Z W1204 09:53:54.159000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5768803Z W1204 09:53:54.159000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5768997Z W1204 09:53:54.159000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5769188Z W1204 09:53:54.160000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5769483Z W1204 09:53:54.160000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5769674Z W1204 09:53:54.160000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5769869Z W1204 09:53:54.161000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5770057Z W1204 09:53:54.161000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5770259Z W1204 09:53:54.161000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5770449Z W1204 09:53:54.162000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5770644Z W1204 09:53:54.162000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5770836Z W1204 09:53:54.162000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5771029Z W1204 09:53:54.163000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5771218Z W1204 09:53:54.163000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5771412Z W1204 09:53:54.163000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5771602Z W1204 09:53:54.164000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5771794Z W1204 09:53:54.164000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5771990Z W1204 09:53:54.164000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5772180Z W1204 09:53:54.165000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5772377Z W1204 09:53:54.165000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5772565Z W1204 09:53:54.166000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5772757Z W1204 09:53:54.166000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5772945Z W1204 09:53:54.166000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5773223Z W1204 09:53:54.167000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5773416Z W1204 09:53:54.167000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5773609Z W1204 09:53:54.167000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5773802Z W1204 09:53:54.168000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5773998Z W1204 09:53:54.168000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5774190Z W1204 09:53:54.168000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5774383Z W1204 09:53:54.169000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5774581Z W1204 09:53:54.169000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5774775Z W1204 09:53:54.169000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5774969Z W1204 09:53:54.170000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5775261Z W1204 09:53:54.170000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5775454Z W1204 09:53:54.170000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5775645Z W1204 09:53:54.171000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5775840Z W1204 09:53:54.171000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5776037Z W1204 09:53:54.171000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5776236Z W1204 09:53:54.172000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5776426Z W1204 09:53:54.172000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5776618Z W1204 09:53:54.172000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5776811Z W1204 09:53:54.173000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5777004Z W1204 09:53:54.173000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5777192Z W1204 09:53:54.173000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5777387Z W1204 09:53:54.174000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5777580Z W1204 09:53:54.174000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5777772Z W1204 09:53:54.174000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5777967Z W1204 09:53:54.175000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5778158Z W1204 09:53:54.175000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5778350Z W1204 09:53:54.175000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5778540Z W1204 09:53:54.176000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5778732Z W1204 09:53:54.176000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5778921Z W1204 09:53:54.177000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5779200Z W1204 09:53:54.177000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5779392Z W1204 09:53:54.177000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5779586Z W1204 09:53:54.178000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5779776Z W1204 09:53:54.178000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5779970Z W1204 09:53:54.178000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5780161Z W1204 09:53:54.179000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5780352Z W1204 09:53:54.179000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5780546Z W1204 09:53:54.179000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5780739Z W1204 09:53:54.180000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5780936Z W1204 09:53:54.180000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5781210Z W1204 09:53:54.180000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5781404Z W1204 09:53:54.181000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5781592Z W1204 09:53:54.181000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5781784Z W1204 09:53:54.181000 112773 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5781843Z FAILED [2.1016s] [100%] 2025-12-04T10:01:44.5781846Z 2025-12-04T10:01:44.5781934Z ==================================== RERUNS ==================================== 2025-12-04T10:01:44.5782074Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5782148Z Traceback (most recent call last): 2025-12-04T10:01:44.5782412Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5782504Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5782739Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5782834Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5783100Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5783216Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5783302Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5783305Z 2025-12-04T10:01:44.5783373Z Mismatched elements: 24032 / 100000 (24.0%) 2025-12-04T10:01:44.5783536Z Greatest absolute difference: 0.9912109375 at index (97575,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5783672Z Greatest relative difference: inf at index (77101,) (up to 0.001 allowed) 2025-12-04T10:01:44.5783675Z 2025-12-04T10:01:44.5783793Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5784064Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5784067Z 2025-12-04T10:01:44.5784209Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5784324Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5784389Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5784479Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5784587Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5784791Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5784854Z graph_break [] 2025-12-04T10:01:44.5784964Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5785533Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5785591Z _warn_once( 2025-12-04T10:01:44.5785732Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5785799Z Traceback (most recent call last): 2025-12-04T10:01:44.5786070Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5786155Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5786379Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5786464Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5786713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5786910Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5786984Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5786987Z 2025-12-04T10:01:44.5787053Z Mismatched elements: 80922 / 100000 (80.9%) 2025-12-04T10:01:44.5787200Z Greatest absolute difference: 0.99658203125 at index (63516,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5787328Z Greatest relative difference: inf at index (20195,) (up to 0.001 allowed) 2025-12-04T10:01:44.5787331Z 2025-12-04T10:01:44.5787433Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5787691Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5787694Z 2025-12-04T10:01:44.5787823Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5787929Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5787992Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5788079Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5788177Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5788289Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5788338Z graph_break [] 2025-12-04T10:01:44.5788447Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5788981Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5789035Z _warn_once( 2025-12-04T10:01:44.5789140Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5789197Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5789286Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5789390Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5789489Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5789538Z graph_break [] 2025-12-04T10:01:44.5789613Z =================================== FAILURES =================================== 2025-12-04T10:01:44.5789740Z _________ GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda __________ 2025-12-04T10:01:44.5789807Z Traceback (most recent call last): 2025-12-04T10:01:44.5790046Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14468, in test_copy_non_blocking_is_pinned 2025-12-04T10:01:44.5790288Z self.assertEqual(outputs, outputs_compiled) 2025-12-04T10:01:44.5790505Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T10:01:44.5790588Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T10:01:44.5790840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T10:01:44.5790945Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T10:01:44.5791016Z AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5791018Z 2025-12-04T10:01:44.5791086Z Mismatched elements: 88924 / 100000 (88.9%) 2025-12-04T10:01:44.5791230Z Greatest absolute difference: 0.99462890625 at index (28259,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5791358Z Greatest relative difference: inf at index (13280,) (up to 0.001 allowed) 2025-12-04T10:01:44.5791361Z 2025-12-04T10:01:44.5791463Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5791720Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5791722Z 2025-12-04T10:01:44.5791854Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5792036Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5792097Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5792178Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5792278Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5792382Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5792435Z graph_break [] 2025-12-04T10:01:44.5792536Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:01:44.5793072Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/profiler/profiler.py:217: UserWarning: Warning: Profiler clears events at the end of each cycle.Only events from the current cycle will be reported.To keep events across cycles, set acc_events=True. 2025-12-04T10:01:44.5793120Z _warn_once( 2025-12-04T10:01:44.5793225Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5793284Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5793364Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5793466Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5793564Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5793615Z graph_break [] 2025-12-04T10:01:44.5793721Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:01:44.5793776Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:01:44.5793861Z stats [('calls_captured', 402), ('unique_graphs', 1)] 2025-12-04T10:01:44.5793965Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:01:44.5794060Z inductor [('extern_calls', 100), ('fxgraph_cache_miss', 1)] 2025-12-04T10:01:44.5794112Z graph_break [] 2025-12-04T10:01:44.5794470Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-62e64dadea44383e.xml - 2025-12-04T10:01:44.5794560Z =========================== short test summary info ============================ 2025-12-04T10:01:44.5794870Z FAILED [2.1016s] inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda - AssertionError: Tensor-likes are not close! 2025-12-04T10:01:44.5794873Z 2025-12-04T10:01:44.5794939Z Mismatched elements: 88924 / 100000 (88.9%) 2025-12-04T10:01:44.5795083Z Greatest absolute difference: 0.99462890625 at index (28259,) (up to 1e-05 allowed) 2025-12-04T10:01:44.5795208Z Greatest relative difference: inf at index (13280,) (up to 0.001 allowed) 2025-12-04T10:01:44.5795213Z 2025-12-04T10:01:44.5795402Z To execute this test, run the following from the base repo dir: 2025-12-04T10:01:44.5795659Z PYTORCH_TEST_WITH_ROCM=1 python test/inductor/test_torchinductor.py GPUTests.test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5795662Z 2025-12-04T10:01:44.5795795Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:01:44.5795886Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:01:44.5795986Z ================== 1 failed, 523 deselected, 2 rerun in 7.34s ================== 2025-12-04T10:01:44.5796043Z Got exit code 1 2025-12-04T10:01:44.5796280Z FAILED CONSISTENTLY: test/inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda 2025-12-04T10:01:44.5796485Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:01:44.5796768Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:01:44.5796855Z ============================= test session starts ============================== 2025-12-04T10:01:44.5797028Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:01:44.5797178Z cachedir: .pytest_cache 2025-12-04T10:01:44.5797425Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:01:44.5797495Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:01:44.5797551Z configfile: pytest.ini 2025-12-04T10:01:44.5797812Z plugins: hypothesis-6.56.4, xdoctest-1.3.0, subtests-0.13.1, rerunfailures-14.0, xdist-3.3.1, flakefinder-1.1.0, cpp-2.3.0, typeguard-4.3.0 2025-12-04T10:01:44.5797928Z collecting ... collected 999 items / 137 deselected / 862 selected 2025-12-04T10:01:44.5798007Z stepcurrent: skipping 137 already run items. 2025-12-04T10:01:44.5798068Z Running 387 items in this shard 2025-12-04T10:01:44.5798075Z 2025-12-04T10:01:44.5798226Z inductor/test_torchinductor.py::GPUTests::test_cos_cuda PASSED [1.7830s] [ 0%] 2025-12-04T10:01:44.5798594Z inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_gpu_tensor_cuda W1204 09:54:02.873000 113639 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5798653Z PASSED [0.1085s] [ 0%] 2025-12-04T10:01:44.5799032Z inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_cuda W1204 09:54:02.997000 113639 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:01:44.5799085Z PASSED [0.1434s] [ 0%] 2025-12-04T10:01:44.5799282Z inductor/test_torchinductor.py::GPUTests::test_cpu_tensor_with_gpu_tensor_cuda PASSED [0.0103s] [ 1%] 2025-12-04T10:01:44.5799431Z inductor/test_torchinductor.py::GPUTests::test_cumsum_cuda PASSED [4.7579s] [ 1%] 2025-12-04T10:01:44.5799599Z inductor/test_torchinductor.py::GPUTests::test_cumsum_no_mask_cuda PASSED [0.7934s] [ 1%] 2025-12-04T10:01:44.5799765Z inductor/test_torchinductor.py::GPUTests::test_cumsum_zero_dim_cuda PASSED [0.6907s] [ 1%] 2025-12-04T10:01:44.5799923Z inductor/test_torchinductor.py::GPUTests::test_custom_op_1_cuda PASSED [0.5560s] [ 2%] 2025-12-04T10:01:44.5800080Z inductor/test_torchinductor.py::GPUTests::test_custom_op_2_cuda PASSED [0.3864s] [ 2%] 2025-12-04T10:01:44.5800296Z inductor/test_torchinductor.py::GPUTests::test_custom_op_default_layout_constraint_cuda PASSED [0.4377s] [ 2%] 2025-12-04T10:01:44.5800506Z inductor/test_torchinductor.py::GPUTests::test_custom_op_fixed_layout_channels_last_cuda PASSED [0.6959s] [ 2%] 2025-12-04T10:01:44.5800708Z inductor/test_torchinductor.py::GPUTests::test_custom_op_fixed_layout_sequential_cuda PASSED [0.7154s] [ 3%] 2025-12-04T10:01:44.5800894Z inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_compiled_cuda PASSED [1.3652s] [ 3%] 2025-12-04T10:01:44.5801057Z inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_cuda PASSED [0.2506s] [ 3%] 2025-12-04T10:01:44.5801334Z inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_multi_input_cuda PASSED [0.0946s] [ 3%] 2025-12-04T10:01:44.5801517Z inductor/test_torchinductor.py::GPUTests::test_custom_scan_would_split_cuda PASSED [0.5752s] [ 4%] 2025-12-04T10:01:44.5801705Z inductor/test_torchinductor.py::GPUTests::test_deterministic_codegen_cuda PASSED [4.1031s] [ 4%] 2025-12-04T10:01:44.5801866Z inductor/test_torchinductor.py::GPUTests::test_diagonal_copy_cuda PASSED [0.5294s] [ 4%] 2025-12-04T10:01:44.5802020Z inductor/test_torchinductor.py::GPUTests::test_dist_bf16_cuda PASSED [0.2856s] [ 4%] 2025-12-04T10:01:44.5802164Z inductor/test_torchinductor.py::GPUTests::test_dist_cuda PASSED [0.3271s] [ 5%] 2025-12-04T10:01:44.5802308Z inductor/test_torchinductor.py::GPUTests::test_div5_cuda PASSED [0.2692s] [ 5%] 2025-12-04T10:01:44.5802448Z inductor/test_torchinductor.py::GPUTests::test_div6_cuda PASSED [0.2071s] [ 5%] 2025-12-04T10:01:44.5802594Z inductor/test_torchinductor.py::GPUTests::test_div8_cuda PASSED [0.8118s] [ 5%] 2025-12-04T10:01:44.5802733Z inductor/test_torchinductor.py::GPUTests::test_div9_cuda PASSED [0.2267s] [ 6%] 2025-12-04T10:01:44.5802883Z inductor/test_torchinductor.py::GPUTests::test_div_prim_cuda PASSED [0.4628s] [ 6%] 2025-12-04T10:01:44.5803143Z inductor/test_torchinductor.py::GPUTests::test_div_softmax_symfloat_cuda PASSED [1.5717s] [ 6%] 2025-12-04T10:01:44.5803307Z inductor/test_torchinductor.py::GPUTests::test_div_zero_dim_cuda PASSED [1.1308s] [ 6%] 2025-12-04T10:01:44.5803455Z inductor/test_torchinductor.py::GPUTests::test_dropout2_cuda PASSED [0.7148s] [ 7%] 2025-12-04T10:01:44.5804287Z inductor/test_torchinductor.py::GPUTests::test_dropout_deterministic_cuda SKIPPED [0.0005s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/133025 for platform(s) inductor, linux. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 7%] 2025-12-04T10:01:44.5804463Z inductor/test_torchinductor.py::GPUTests::test_dropout_trivial_0_cuda PASSED [0.2001s] [ 7%] 2025-12-04T10:01:44.5804630Z inductor/test_torchinductor.py::GPUTests::test_dtype_sympy_expr_cuda PASSED [1.2744s] [ 8%] 2025-12-04T10:01:44.5804830Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_bfloat16_cuda PASSED [0.4485s] [ 8%] 2025-12-04T10:01:44.5805017Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_float16_cuda PASSED [0.2424s] [ 8%] 2025-12-04T10:01:44.5805205Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_uint8_cuda PASSED [0.2559s] [ 8%] 2025-12-04T10:01:44.5805388Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float16_cuda PASSED [0.2403s] [ 9%] 2025-12-04T10:01:44.5805574Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float64_cuda PASSED [0.5302s] [ 9%] 2025-12-04T10:01:44.5805755Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int32_cuda PASSED [0.3159s] [ 9%] 2025-12-04T10:01:44.5805940Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int64_cuda PASSED [0.3002s] [ 9%] 2025-12-04T10:01:44.5806121Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_uint8_cuda PASSED [0.2412s] [ 10%] 2025-12-04T10:01:44.5806352Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float16_cuda PASSED [0.1801s] [ 10%] 2025-12-04T10:01:44.5806534Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float32_cuda PASSED [0.1693s] [ 10%] 2025-12-04T10:01:44.5806715Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int32_cuda PASSED [0.1751s] [ 10%] 2025-12-04T10:01:44.5806895Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int8_cuda PASSED [0.0027s] [ 11%] 2025-12-04T10:01:44.5807077Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_uint8_cuda PASSED [0.0025s] [ 11%] 2025-12-04T10:01:44.5807363Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float16_cuda PASSED [0.0025s] [ 11%] 2025-12-04T10:01:44.5807540Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int16_cuda PASSED [0.0025s] [ 11%] 2025-12-04T10:01:44.5807718Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float16_cuda PASSED [0.2499s] [ 12%] 2025-12-04T10:01:44.5807900Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float32_cuda PASSED [0.3048s] [ 12%] 2025-12-04T10:01:44.5808098Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int32_cuda PASSED [0.3161s] [ 12%] 2025-12-04T10:01:44.5808281Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int64_cuda PASSED [0.3068s] [ 12%] 2025-12-04T10:01:44.5808466Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int8_cuda PASSED [0.2491s] [ 13%] 2025-12-04T10:01:44.5808651Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_float16_cuda PASSED [0.1797s] [ 13%] 2025-12-04T10:01:44.5808843Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_float32_cuda PASSED [0.1790s] [ 13%] 2025-12-04T10:01:44.5809030Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_float64_cuda PASSED [0.1859s] [ 13%] 2025-12-04T10:01:44.5809216Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int16_cuda PASSED [0.1841s] [ 14%] 2025-12-04T10:01:44.5809539Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int32_cuda PASSED [0.1773s] [ 14%] 2025-12-04T10:01:44.5809728Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int64_cuda PASSED [0.1911s] [ 14%] 2025-12-04T10:01:44.5809911Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int8_cuda PASSED [0.0025s] [ 14%] 2025-12-04T10:01:44.5810102Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float32_cuda PASSED [0.1307s] [ 15%] 2025-12-04T10:01:44.5810286Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float64_cuda PASSED [0.1214s] [ 15%] 2025-12-04T10:01:44.5810475Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int32_cuda PASSED [0.1219s] [ 15%] 2025-12-04T10:01:44.5810661Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int64_cuda PASSED [0.1255s] [ 16%] 2025-12-04T10:01:44.5810845Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int8_cuda PASSED [0.0026s] [ 16%] 2025-12-04T10:01:44.5811035Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_uint8_cuda PASSED [0.0024s] [ 16%] 2025-12-04T10:01:44.5811226Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_bfloat16_cuda PASSED [0.0023s] [ 16%] 2025-12-04T10:01:44.5811419Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float32_cuda PASSED [0.0023s] [ 17%] 2025-12-04T10:01:44.5811608Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float64_cuda PASSED [0.0023s] [ 17%] 2025-12-04T10:01:44.5811794Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int16_cuda PASSED [0.0023s] [ 17%] 2025-12-04T10:01:44.5811980Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int32_cuda PASSED [0.0023s] [ 17%] 2025-12-04T10:01:44.5812165Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int64_cuda PASSED [0.0023s] [ 18%] 2025-12-04T10:01:44.5812358Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_bfloat16_cuda PASSED [0.0023s] [ 18%] 2025-12-04T10:01:44.5812552Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float16_cuda PASSED [0.0023s] [ 18%] 2025-12-04T10:01:44.5812739Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float32_cuda PASSED [0.0023s] [ 18%] 2025-12-04T10:01:44.5812928Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float64_cuda PASSED [0.0023s] [ 19%] 2025-12-04T10:01:44.5813112Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int32_cuda PASSED [0.0024s] [ 19%] 2025-12-04T10:01:44.5813281Z inductor/test_torchinductor.py::GPUTests::test_embedding_cuda PASSED [0.3237s] [ 19%] 2025-12-04T10:01:44.5813527Z inductor/test_torchinductor.py::GPUTests::test_empty1_cuda PASSED [0.0679s] [ 19%] 2025-12-04T10:01:44.5813688Z inductor/test_torchinductor.py::GPUTests::test_empty2_cuda PASSED [0.0661s] [ 20%] 2025-12-04T10:01:44.5813914Z inductor/test_torchinductor.py::GPUTests::test_emulate_precision_triton_fp_fusion_cuda PASSED [0.3526s] [ 20%] 2025-12-04T10:01:44.5814071Z inductor/test_torchinductor.py::GPUTests::test_erfc_cuda PASSED [0.6825s] [ 20%] 2025-12-04T10:01:44.5814228Z inductor/test_torchinductor.py::GPUTests::test_erfinv_cuda PASSED [0.8905s] [ 20%] 2025-12-04T10:01:44.5814397Z inductor/test_torchinductor.py::GPUTests::test_exact_stride_cuda PASSED [0.3117s] [ 21%] 2025-12-04T10:01:44.5814553Z inductor/test_torchinductor.py::GPUTests::test_expm1_cuda PASSED [1.6876s] [ 21%] 2025-12-04T10:01:44.5814749Z inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_list_cuda PASSED [0.1054s] [ 21%] 2025-12-04T10:01:44.5814984Z inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_no_mutated_tensors_cuda PASSED [0.0189s] [ 21%] 2025-12-04T10:01:44.5815163Z inductor/test_torchinductor.py::GPUTests::test_fft_real_input_cuda PASSED [1.2440s] [ 22%] 2025-12-04T10:01:44.5815320Z inductor/test_torchinductor.py::GPUTests::test_fill2_cuda PASSED [0.2204s] [ 22%] 2025-12-04T10:01:44.5815549Z inductor/test_torchinductor.py::GPUTests::test_flexible_layout_immutable_free_symbols_cuda PASSED [0.0022s] [ 22%] 2025-12-04T10:01:44.5815799Z inductor/test_torchinductor.py::GPUTests::test_flip_cat_cuda PASSED [0.3593s] [ 22%] 2025-12-04T10:01:44.5815948Z inductor/test_torchinductor.py::GPUTests::test_flip_cuda PASSED [0.2598s] [ 23%] 2025-12-04T10:01:44.5816139Z inductor/test_torchinductor.py::GPUTests::test_float_index_expression_cuda PASSED [0.0060s] [ 23%] 2025-12-04T10:01:44.5816332Z inductor/test_torchinductor.py::GPUTests::test_float_repr_dynamic_shapes_cuda PASSED [1.6661s] [ 23%] 2025-12-04T10:01:44.5816492Z inductor/test_torchinductor.py::GPUTests::test_fmin_fmax_cuda PASSED [0.3311s] [ 24%] 2025-12-04T10:01:44.5816681Z inductor/test_torchinductor.py::GPUTests::test_forced_buffer_realize_cuda PASSED [0.1996s] [ 24%] 2025-12-04T10:01:44.5816876Z inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d1_cuda PASSED [0.5906s] [ 24%] 2025-12-04T10:01:44.5817068Z inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d2_cuda PASSED [1.2864s] [ 24%] 2025-12-04T10:01:44.5817258Z inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d3_cuda PASSED [0.2616s] [ 25%] 2025-12-04T10:01:44.5817450Z inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d4_cuda PASSED [0.7418s] [ 25%] 2025-12-04T10:01:44.5817635Z inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d5_cuda PASSED [0.5063s] [ 25%] 2025-12-04T10:01:44.5817825Z inductor/test_torchinductor.py::GPUTests::test_full_like_transposed_cuda PASSED [0.2006s] [ 25%] 2025-12-04T10:01:44.5817987Z inductor/test_torchinductor.py::GPUTests::test_fuse_tiled_cuda PASSED [0.2567s] [ 26%] 2025-12-04T10:01:44.5818150Z inductor/test_torchinductor.py::GPUTests::test_gather2_cuda PASSED [0.0019s] [ 26%] 2025-12-04T10:01:44.5818321Z inductor/test_torchinductor.py::GPUTests::test_gather_scatter_cuda PASSED [0.2038s] [ 26%] 2025-12-04T10:01:44.5818478Z inductor/test_torchinductor.py::GPUTests::test_getitem_cuda PASSED [0.0132s] [ 26%] 2025-12-04T10:01:44.5818680Z inductor/test_torchinductor.py::GPUTests::test_gpu_scalar_with_cpu_tensor_cuda PASSED [0.0097s] [ 27%] 2025-12-04T10:01:44.5818878Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_arange1_cuda PASSED [0.8660s] [ 27%] 2025-12-04T10:01:44.5819091Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_constant_tensor2_cuda PASSED [0.1053s] [ 27%] 2025-12-04T10:01:44.5819309Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_misaligned_input_cuda PASSED [0.7724s] [ 27%] 2025-12-04T10:01:44.5819505Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_no_inputs_cuda PASSED [0.4630s] [ 28%] 2025-12-04T10:01:44.5819835Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_unbacked_symint_as_output_cuda PASSED [0.2386s] [ 28%] 2025-12-04T10:01:44.5820003Z inductor/test_torchinductor.py::GPUTests::test_hardsigmoid_cuda PASSED [0.3453s] [ 28%] 2025-12-04T10:01:44.5820166Z inductor/test_torchinductor.py::GPUTests::test_hardswish_cuda PASSED [0.3713s] [ 28%] 2025-12-04T10:01:44.5820352Z inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion2_cuda PASSED [0.2552s] [ 29%] 2025-12-04T10:01:44.5820508Z inductor/test_torchinductor.py::GPUTests::test_index1_cuda PASSED [0.5479s] [ 29%] 2025-12-04T10:01:44.5820693Z inductor/test_torchinductor.py::GPUTests::test_index_dynamic_shapes_cuda PASSED [0.6161s] [ 29%] 2025-12-04T10:01:44.5820877Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_abs_cuda PASSED [0.1190s] [ 29%] 2025-12-04T10:01:44.5821109Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_device_assert_masked_cuda PASSED [0.2613s] [ 30%] 2025-12-04T10:01:44.5821354Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_nested_indirect_indexing_cuda PASSED [0.2102s] [ 30%] 2025-12-04T10:01:44.5821520Z inductor/test_torchinductor.py::GPUTests::test_index_put1_cuda PASSED [1.0634s] [ 30%] 2025-12-04T10:01:44.5821678Z inductor/test_torchinductor.py::GPUTests::test_index_put2_cuda PASSED [0.4297s] [ 31%] 2025-12-04T10:01:44.5821931Z inductor/test_torchinductor.py::GPUTests::test_index_put4_cuda PASSED [0.2219s] [ 31%] 2025-12-04T10:01:44.5822145Z inductor/test_torchinductor.py::GPUTests::test_index_put_deterministic_fallback_cuda PASSED [0.1083s] [ 31%] 2025-12-04T10:01:44.5822324Z inductor/test_torchinductor.py::GPUTests::test_index_put_index_cuda PASSED [0.2315s] [ 31%] 2025-12-04T10:01:44.5822510Z inductor/test_torchinductor.py::GPUTests::test_index_put_reinplace_cuda PASSED [0.2018s] [ 32%] 2025-12-04T10:01:44.5822686Z inductor/test_torchinductor.py::GPUTests::test_index_remainder_cuda PASSED [0.2040s] [ 32%] 2025-12-04T10:01:44.5822854Z inductor/test_torchinductor.py::GPUTests::test_index_select_cuda PASSED [0.6123s] [ 32%] 2025-12-04T10:01:44.5823078Z inductor/test_torchinductor.py::GPUTests::test_inductor_multiple_specializations_cuda PASSED [2.8568s] [ 32%] 2025-12-04T10:01:44.5823233Z inductor/test_torchinductor.py::GPUTests::test_inf_cuda PASSED [0.2062s] [ 33%] 2025-12-04T10:01:44.5823430Z inductor/test_torchinductor.py::GPUTests::test_inner_fn_str_and_stride_cuda PASSED [0.1004s] [ 33%] 2025-12-04T10:01:44.5823634Z inductor/test_torchinductor.py::GPUTests::test_inner_reduction_detection_cuda PASSED [0.1659s] [ 33%] 2025-12-04T10:01:44.5823818Z inductor/test_torchinductor.py::GPUTests::test_inplace_activations_cuda PASSED [0.5195s] [ 33%] 2025-12-04T10:01:44.5823986Z inductor/test_torchinductor.py::GPUTests::test_inplace_add_cuda PASSED [0.1033s] [ 34%] 2025-12-04T10:01:44.5824151Z inductor/test_torchinductor.py::GPUTests::test_inplace_flip_cuda PASSED [0.9500s] [ 34%] 2025-12-04T10:01:44.5824329Z inductor/test_torchinductor.py::GPUTests::test_input_mutation1_cuda PASSED [0.1198s] [ 34%] 2025-12-04T10:01:44.5824499Z inductor/test_torchinductor.py::GPUTests::test_input_mutation3_cuda PASSED [0.1474s] [ 34%] 2025-12-04T10:01:44.5824675Z inductor/test_torchinductor.py::GPUTests::test_input_mutation4_cuda PASSED [0.0868s] [ 35%] 2025-12-04T10:01:44.5824843Z inductor/test_torchinductor.py::GPUTests::test_input_mutation5_cuda PASSED [0.0876s] [ 35%] 2025-12-04T10:01:44.5825040Z inductor/test_torchinductor.py::GPUTests::test_insignificant_strides_cuda PASSED [0.0920s] [ 35%] 2025-12-04T10:01:44.5825193Z inductor/test_torchinductor.py::GPUTests::test_isinf_cuda PASSED [0.4115s] [ 35%] 2025-12-04T10:01:44.5825360Z inductor/test_torchinductor.py::GPUTests::test_issue102546_cuda PASSED [0.1551s] [ 36%] 2025-12-04T10:01:44.5825526Z inductor/test_torchinductor.py::GPUTests::test_kernel_names_cuda PASSED [0.0820s] [ 36%] 2025-12-04T10:01:44.5825770Z inductor/test_torchinductor.py::GPUTests::test_kwargs_cuda SKIPPED [0.0019s] (histogramdd only supports cpu) [ 36%] 2025-12-04T10:01:44.5825927Z inductor/test_torchinductor.py::GPUTests::test_l1_loss_cuda PASSED [0.2384s] [ 36%] 2025-12-04T10:01:44.5826225Z inductor/test_torchinductor.py::GPUTests::test_large_broadcast_reduction_cuda PASSED [0.1973s] [ 37%] 2025-12-04T10:01:44.5826425Z inductor/test_torchinductor.py::GPUTests::test_large_grid_use_block_ptr_True_cuda PASSED [0.4890s] [ 37%] 2025-12-04T10:01:44.5826597Z inductor/test_torchinductor.py::GPUTests::test_large_pointwise_cuda PASSED [0.3946s] [ 37%] 2025-12-04T10:01:44.5826777Z inductor/test_torchinductor.py::GPUTests::test_large_strided_reduction_cuda PASSED [0.7312s] [ 37%] 2025-12-04T10:01:44.5826959Z inductor/test_torchinductor.py::GPUTests::test_large_tensor_reduction_cuda PASSED [0.4235s] [ 38%] 2025-12-04T10:01:44.5827115Z inductor/test_torchinductor.py::GPUTests::test_leaky_relu_cuda PASSED [0.2784s] [ 38%] 2025-12-04T10:01:44.5827262Z inductor/test_torchinductor.py::GPUTests::test_lgamma_cuda PASSED [1.2466s] [ 38%] 2025-12-04T10:01:44.5827421Z inductor/test_torchinductor.py::GPUTests::test_like_rands3_cuda PASSED [0.6039s] [ 39%] 2025-12-04T10:01:44.5827594Z inductor/test_torchinductor.py::GPUTests::test_like_rands_sliced_cuda PASSED [0.1620s] [ 39%] 2025-12-04T10:01:44.5827796Z inductor/test_torchinductor.py::GPUTests::test_linalg_eig_stride_consistency_cuda PASSED [0.1099s] [ 39%] 2025-12-04T10:01:44.5828025Z inductor/test_torchinductor.py::GPUTests::test_linear1_cuda PASSED [0.9984s] [ 39%] 2025-12-04T10:01:44.5828178Z inductor/test_torchinductor.py::GPUTests::test_linear2_cuda PASSED [0.8263s] [ 40%] 2025-12-04T10:01:44.5828349Z inductor/test_torchinductor.py::GPUTests::test_linear_mixed_dtype_cuda PASSED [0.1667s] [ 40%] 2025-12-04T10:01:44.5828507Z inductor/test_torchinductor.py::GPUTests::test_linspace1_cuda PASSED [0.2625s] [ 40%] 2025-12-04T10:01:44.5828657Z inductor/test_torchinductor.py::GPUTests::test_linspace2_cuda PASSED [0.1821s] [ 40%] 2025-12-04T10:01:44.5828894Z inductor/test_torchinductor.py::GPUTests::test_linspace4_cuda SKIPPED [0.0002s] (requires multiple cuda devices) [ 41%] 2025-12-04T10:01:44.5829066Z inductor/test_torchinductor.py::GPUTests::test_lite_mode_fallback_cuda PASSED [0.0655s] [ 41%] 2025-12-04T10:01:44.5829284Z inductor/test_torchinductor.py::GPUTests::test_lite_regional_compile_flex_attention_cuda PASSED [2.9307s] [ 41%] 2025-12-04T10:01:44.5829499Z inductor/test_torchinductor.py::GPUTests::test_lite_regional_compile_repeated_blocks_cuda PASSED [0.1668s] [ 41%] 2025-12-04T10:01:44.5829718Z inductor/test_torchinductor.py::GPUTests::test_lite_triton_kernel_wrapper_functional_cuda PASSED [0.2953s] [ 42%] 2025-12-04T10:01:44.5829866Z inductor/test_torchinductor.py::GPUTests::test_log_fp64_cuda PASSED [0.2104s] [ 42%] 2025-12-04T10:01:44.5830027Z inductor/test_torchinductor.py::GPUTests::test_log_softmax_cuda PASSED [0.9730s] [ 42%] 2025-12-04T10:01:44.5830239Z inductor/test_torchinductor.py::GPUTests::test_logaddexp_cuda SKIPPED [0.0020s] (Not implemented for CUDA) [ 42%] 2025-12-04T10:01:44.5830404Z inductor/test_torchinductor.py::GPUTests::test_logcumsumexp_cuda PASSED [174.7067s] [ 43%] 2025-12-04T10:01:44.5830591Z inductor/test_torchinductor.py::GPUTests::test_logcumsumexp_zero_dim_cuda PASSED [0.1864s] [ 43%] 2025-12-04T10:01:44.5830797Z inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_2_cuda PASSED [1.2479s] [ 43%] 2025-12-04T10:01:44.5831006Z inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_3_cuda PASSED [3.4469s] [ 43%] 2025-12-04T10:01:44.5831164Z inductor/test_torchinductor.py::GPUTests::test_masked_scatter_cuda PASSED [0.4247s] [ 44%] 2025-12-04T10:01:44.5831338Z inductor/test_torchinductor.py::GPUTests::test_matmul_layer_norm_cuda PASSED [0.3209s] [ 44%] 2025-12-04T10:01:44.5831484Z inductor/test_torchinductor.py::GPUTests::test_max_min_cuda PASSED [0.3203s] [ 44%] 2025-12-04T10:01:44.5831641Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d3_cuda PASSED [1.1834s] [ 44%] 2025-12-04T10:01:44.5831791Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d5_cuda PASSED [0.9374s] [ 45%] 2025-12-04T10:01:44.5832090Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward2_cuda PASSED [2.4146s] [ 45%] 2025-12-04T10:01:44.5832295Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward3_cuda PASSED [1.1821s] [ 45%] 2025-12-04T10:01:44.5832495Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward4_cuda PASSED [5.6926s] [ 45%] 2025-12-04T10:01:44.5832700Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward6_cuda PASSED [0.1515s] [ 46%] 2025-12-04T10:01:44.5832878Z inductor/test_torchinductor.py::GPUTests::test_min_max_reduction_nan_cuda PASSED [0.2345s] [ 46%] 2025-12-04T10:01:44.5833047Z inductor/test_torchinductor.py::GPUTests::test_mix_device_index_cuda PASSED [0.1429s] [ 46%] 2025-12-04T10:01:44.5833197Z inductor/test_torchinductor.py::GPUTests::test_mixed_mm2_cuda PASSED [0.2577s] [ 47%] 2025-12-04T10:01:44.5833351Z inductor/test_torchinductor.py::GPUTests::test_mixed_mm3_cuda PASSED [0.2154s] [ 47%] 2025-12-04T10:01:44.5833500Z inductor/test_torchinductor.py::GPUTests::test_mixed_mm_cuda PASSED [0.2048s] [ 47%] 2025-12-04T10:01:44.5833665Z inductor/test_torchinductor.py::GPUTests::test_mm_mixed_dtype_cuda PASSED [0.0668s] [ 47%] 2025-12-04T10:01:44.5833811Z inductor/test_torchinductor.py::GPUTests::test_mm_views_cuda PASSED [0.0767s] [ 48%] 2025-12-04T10:01:44.5834058Z inductor/test_torchinductor.py::GPUTests::test_mul_index_expr_cuda PASSED [0.1807s] [ 48%] 2025-12-04T10:01:44.5834233Z inductor/test_torchinductor.py::GPUTests::test_mul_softmax_symfloat_cuda PASSED [1.5084s] [ 48%] 2025-12-04T10:01:44.5834479Z inductor/test_torchinductor.py::GPUTests::test_multi_gpu_device_cuda SKIPPED [0.0002s] (requires multiple cuda devices) [ 48%] 2025-12-04T10:01:44.5834746Z inductor/test_torchinductor.py::GPUTests::test_multi_gpu_recompile_on_index_cuda SKIPPED [0.0001s] (requires multiple cuda devices) [ 49%] 2025-12-04T10:01:44.5834909Z inductor/test_torchinductor.py::GPUTests::test_multilayer_var_cuda PASSED [0.9602s] [ 49%] 2025-12-04T10:01:44.5835087Z inductor/test_torchinductor.py::GPUTests::test_multilayer_var_lowp_cuda PASSED [0.7523s] [ 49%] 2025-12-04T10:01:44.5835292Z inductor/test_torchinductor.py::GPUTests::test_mutable_custom_op_fixed_layout2_cuda PASSED [0.1903s] [ 49%] 2025-12-04T10:01:44.5835488Z inductor/test_torchinductor.py::GPUTests::test_mutable_custom_op_fixed_layout_cuda PASSED [0.4429s] [ 50%] 2025-12-04T10:01:44.5835709Z inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_False_descending_False_cuda PASSED [0.3822s] [ 50%] 2025-12-04T10:01:44.5835917Z inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_False_descending_True_cuda PASSED [0.3776s] [ 50%] 2025-12-04T10:01:44.5836075Z inductor/test_torchinductor.py::GPUTests::test_nan_to_num_cuda PASSED [0.2333s] [ 50%] 2025-12-04T10:01:44.5836227Z inductor/test_torchinductor.py::GPUTests::test_neg_index_cuda PASSED [0.9339s] [ 51%] 2025-12-04T10:01:44.5836387Z inductor/test_torchinductor.py::GPUTests::test_neg_max_uint8_cuda PASSED [0.1543s] [ 51%] 2025-12-04T10:01:44.5836562Z inductor/test_torchinductor.py::GPUTests::test_nll_loss_backward_cuda PASSED [0.4258s] [ 51%] 2025-12-04T10:01:44.5836729Z inductor/test_torchinductor.py::GPUTests::test_nll_loss_forward_cuda PASSED [0.4744s] [ 51%] 2025-12-04T10:01:44.5836926Z inductor/test_torchinductor.py::GPUTests::test_no_mega_fusion_during_lowering_cuda PASSED [0.4494s] [ 52%] 2025-12-04T10:01:44.5837093Z inductor/test_torchinductor.py::GPUTests::test_no_op_reduction_cuda PASSED [0.2112s] [ 52%] 2025-12-04T10:01:44.5837305Z inductor/test_torchinductor.py::GPUTests::test_no_specization_over_symbolic_value_cuda PASSED [0.4407s] [ 52%] 2025-12-04T10:01:44.5837500Z inductor/test_torchinductor.py::GPUTests::test_nonzero_unbacked_refinement_cuda PASSED [0.9491s] [ 52%] 2025-12-04T10:01:44.5837685Z inductor/test_torchinductor.py::GPUTests::test_norm_constant_overflow_cuda PASSED [0.3735s] [ 53%] 2025-12-04T10:01:44.5837846Z inductor/test_torchinductor.py::GPUTests::test_output_strides_cuda PASSED [0.2534s] [ 53%] 2025-12-04T10:01:44.5838124Z inductor/test_torchinductor.py::GPUTests::test_pattern_matcher_multi_user_cuda PASSED [1.0291s] [ 53%] 2025-12-04T10:01:44.5838311Z inductor/test_torchinductor.py::GPUTests::test_pattern_matcher_unbacked_cuda PASSED [0.2209s] [ 54%] 2025-12-04T10:01:44.5838466Z inductor/test_torchinductor.py::GPUTests::test_permute1_cuda PASSED [0.2534s] [ 54%] 2025-12-04T10:01:44.5838612Z inductor/test_torchinductor.py::GPUTests::test_permute2_cuda PASSED [0.1349s] [ 54%] 2025-12-04T10:01:44.5838791Z inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j0_cuda PASSED [0.4797s] [ 54%] 2025-12-04T10:01:44.5838963Z inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j1_cuda PASSED [0.3815s] [ 55%] 2025-12-04T10:01:44.5839176Z inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_u_cuda PASSED [0.4898s] [ 55%] 2025-12-04T10:01:44.5839382Z inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_v_cuda PASSED [0.4473s] [ 55%] 2025-12-04T10:01:44.5839586Z inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_w_cuda PASSED [0.4466s] [ 55%] 2025-12-04T10:01:44.5839760Z inductor/test_torchinductor.py::GPUTests::test_pointwise_digamma_cuda PASSED [0.7320s] [ 56%] 2025-12-04T10:01:44.5839920Z inductor/test_torchinductor.py::GPUTests::test_pointwise_entr_cuda PASSED [0.7029s] [ 56%] 2025-12-04T10:01:44.5840165Z inductor/test_torchinductor.py::GPUTests::test_pointwise_erfc_cuda PASSED [0.2223s] [ 56%] 2025-12-04T10:01:44.5840331Z inductor/test_torchinductor.py::GPUTests::test_pointwise_expit_cuda PASSED [0.2033s] [ 56%] 2025-12-04T10:01:44.5840496Z inductor/test_torchinductor.py::GPUTests::test_pointwise_expm1_cuda PASSED [0.2182s] [ 57%] 2025-12-04T10:01:44.5840668Z inductor/test_torchinductor.py::GPUTests::test_pointwise_gammainc_cuda PASSED [0.0796s] [ 57%] 2025-12-04T10:01:44.5840845Z inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaincc_cuda PASSED [0.0685s] [ 57%] 2025-12-04T10:01:44.5841048Z inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_he_cuda PASSED [0.2842s] [ 57%] 2025-12-04T10:01:44.5841212Z inductor/test_torchinductor.py::GPUTests::test_pointwise_i0_cuda PASSED [0.7330s] [ 58%] 2025-12-04T10:01:44.5841371Z inductor/test_torchinductor.py::GPUTests::test_pointwise_i0e_cuda PASSED [0.6145s] [ 58%] 2025-12-04T10:01:44.5841539Z inductor/test_torchinductor.py::GPUTests::test_pointwise_i1e_cuda PASSED [0.2876s] [ 58%] 2025-12-04T10:01:44.5841704Z inductor/test_torchinductor.py::GPUTests::test_pointwise_logit_cuda PASSED [0.2571s] [ 58%] 2025-12-04T10:01:44.5841901Z inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_k0_cuda PASSED [0.3380s] [ 59%] 2025-12-04T10:01:44.5842062Z inductor/test_torchinductor.py::GPUTests::test_pointwise_ndtr_cuda PASSED [0.2479s] [ 59%] 2025-12-04T10:01:44.5842240Z inductor/test_torchinductor.py::GPUTests::test_pointwise_polygamma_cuda PASSED [0.7247s] [ 59%] 2025-12-04T10:01:44.5842460Z inductor/test_torchinductor.py::GPUTests::test_pointwise_scaled_modified_bessel_k1_cuda PASSED [0.3257s] [ 59%] 2025-12-04T10:01:44.5842685Z inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_t_cuda PASSED [0.3523s] [ 60%] 2025-12-04T10:01:44.5842909Z inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_u_cuda PASSED [0.4935s] [ 60%] 2025-12-04T10:01:44.5843073Z inductor/test_torchinductor.py::GPUTests::test_pointwise_sinc_cuda PASSED [0.8537s] [ 60%] 2025-12-04T10:01:44.5843247Z inductor/test_torchinductor.py::GPUTests::test_pointwise_xlog1py_cuda PASSED [0.2831s] [ 60%] 2025-12-04T10:01:44.5843410Z inductor/test_torchinductor.py::GPUTests::test_pointwise_xlogy_cuda PASSED [0.2662s] [ 61%] 2025-12-04T10:01:44.5843564Z inductor/test_torchinductor.py::GPUTests::test_polar_cuda PASSED [0.3328s] [ 61%] 2025-12-04T10:01:44.5843709Z inductor/test_torchinductor.py::GPUTests::test_pow1_cuda PASSED [0.7045s] [ 61%] 2025-12-04T10:01:44.5843855Z inductor/test_torchinductor.py::GPUTests::test_pow2_cuda PASSED [0.5349s] [ 62%] 2025-12-04T10:01:44.5844081Z inductor/test_torchinductor.py::GPUTests::test_pow3_cuda PASSED [0.1078s] [ 62%] 2025-12-04T10:01:44.5844238Z inductor/test_torchinductor.py::GPUTests::test_pow_int_cuda PASSED [0.8206s] [ 62%] 2025-12-04T10:01:44.5844401Z inductor/test_torchinductor.py::GPUTests::test_pow_symfloat_cuda PASSED [0.2105s] [ 62%] 2025-12-04T10:01:44.5844606Z inductor/test_torchinductor.py::GPUTests::test_prepare_softmax_with_fast_math_cuda PASSED [0.8976s] [ 63%] 2025-12-04T10:01:44.5845001Z inductor/test_torchinductor.py::GPUTests::test_profiler_mark_wrapper_call_cuda [W1204 09:58:51.351196013 collection.cpp:1148] Warning: ROCTracer produced duplicate flow start: 20 (function operator()) 2025-12-04T10:01:44.5845065Z PASSED [0.1079s] [ 63%] 2025-12-04T10:01:44.5845212Z inductor/test_torchinductor.py::GPUTests::test_randint_cuda PASSED [0.2700s] [ 63%] 2025-12-04T10:01:44.5845398Z inductor/test_torchinductor.py::GPUTests::test_randint_distribution_cuda PASSED [0.2128s] [ 63%] 2025-12-04T10:01:44.5845573Z inductor/test_torchinductor.py::GPUTests::test_randint_int64_mod_cuda PASSED [0.1598s] [ 64%] 2025-12-04T10:01:44.5845739Z inductor/test_torchinductor.py::GPUTests::test_randn_generator_cuda PASSED [0.3261s] [ 64%] 2025-12-04T10:01:44.5845909Z inductor/test_torchinductor.py::GPUTests::test_randn_like_empty_cuda PASSED [0.1397s] [ 64%] 2025-12-04T10:01:44.5846154Z inductor/test_torchinductor.py::GPUTests::test_reduction1_cuda PASSED [0.3278s] [ 64%] 2025-12-04T10:01:44.5846363Z inductor/test_torchinductor.py::GPUTests::test_reduction2_cuda PASSED [0.5814s] [ 65%] 2025-12-04T10:01:44.5846512Z inductor/test_torchinductor.py::GPUTests::test_reduction3_cuda PASSED [0.2717s] [ 65%] 2025-12-04T10:01:44.5846665Z inductor/test_torchinductor.py::GPUTests::test_reduction5_cuda PASSED [0.2594s] [ 65%] 2025-12-04T10:01:44.5846847Z inductor/test_torchinductor.py::GPUTests::test_reduction_config_limit_cuda PASSED [0.0020s] [ 65%] 2025-12-04T10:01:44.5847042Z inductor/test_torchinductor.py::GPUTests::test_reflection_pad2d_backward_cuda PASSED [3.7553s] [ 66%] 2025-12-04T10:01:44.5847222Z inductor/test_torchinductor.py::GPUTests::test_reinterpret_dtypeview_cuda PASSED [0.1849s] [ 66%] 2025-12-04T10:01:44.5847372Z inductor/test_torchinductor.py::GPUTests::test_relu_cuda PASSED [0.2555s] [ 66%] 2025-12-04T10:01:44.5847537Z inductor/test_torchinductor.py::GPUTests::test_remove_no_ops_cuda PASSED [1.1491s] [ 66%] 2025-12-04T10:01:44.5847708Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_slice_cuda PASSED [0.2153s] [ 67%] 2025-12-04T10:01:44.5848570Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_slice_scatter_cuda SKIPPED [0.0005s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151378 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 67%] 2025-12-04T10:01:44.5849420Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_view_default_cuda SKIPPED [0.0004s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151511 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 67%] 2025-12-04T10:01:44.5850265Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_view_dtype_cuda SKIPPED [0.0004s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151541 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 67%] 2025-12-04T10:01:44.5850451Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_2_cuda PASSED [0.1868s] [ 68%] 2025-12-04T10:01:44.5850811Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_1_cuda PASSED [0.1456s] [ 68%] 2025-12-04T10:01:44.5851030Z inductor/test_torchinductor.py::GPUTests::test_replication_pad_errors_with_bool_cuda PASSED [0.5854s] [ 68%] 2025-12-04T10:01:44.5851231Z inductor/test_torchinductor.py::GPUTests::test_require_stride_expanded_cuda PASSED [0.2000s] [ 68%] 2025-12-04T10:01:44.5851402Z inductor/test_torchinductor.py::GPUTests::test_resize_as_cuda PASSED [7.3257s] [ 69%] 2025-12-04T10:01:44.5851558Z inductor/test_torchinductor.py::GPUTests::test_resize_cuda PASSED [2.7449s] [ 69%] 2025-12-04T10:01:44.5851764Z inductor/test_torchinductor.py::GPUTests::test_reuse_buffers_with_aliasing_cuda PASSED [0.7710s] [ 69%] 2025-12-04T10:01:44.5851914Z inductor/test_torchinductor.py::GPUTests::test_roll_cuda PASSED [0.4473s] [ 70%] 2025-12-04T10:01:44.5852073Z inductor/test_torchinductor.py::GPUTests::test_rsqrt_cuda PASSED [0.2259s] [ 70%] 2025-12-04T10:01:44.5852243Z inductor/test_torchinductor.py::GPUTests::test_scalar_output_cuda PASSED [0.8773s] [ 70%] 2025-12-04T10:01:44.5852559Z inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_attention_cuda SKIPPED [0.0019s] (Can't run flash attention on this platform) [ 70%] 2025-12-04T10:01:44.5852718Z inductor/test_torchinductor.py::GPUTests::test_scatter1_cuda PASSED [0.6138s] [ 71%] 2025-12-04T10:01:44.5853027Z inductor/test_torchinductor.py::GPUTests::test_scatter2_cuda SKIPPED [0.0019s] (unstable on sm86) [ 71%] 2025-12-04T10:01:44.5853182Z inductor/test_torchinductor.py::GPUTests::test_scatter3_cuda PASSED [0.2282s] [ 71%] 2025-12-04T10:01:44.5853339Z inductor/test_torchinductor.py::GPUTests::test_scatter4_cuda PASSED [0.4461s] [ 71%] 2025-12-04T10:01:44.5853492Z inductor/test_torchinductor.py::GPUTests::test_scatter5_cuda PASSED [0.5239s] [ 72%] 2025-12-04T10:01:44.5853735Z inductor/test_torchinductor.py::GPUTests::test_scatter_add1_cuda SKIPPED [0.0002s] (Flaky test, needs debugging) [ 72%] 2025-12-04T10:01:44.5853906Z inductor/test_torchinductor.py::GPUTests::test_scatter_add3_cuda PASSED [0.3985s] [ 72%] 2025-12-04T10:01:44.5854086Z inductor/test_torchinductor.py::GPUTests::test_scatter_reduce1_cuda PASSED [0.2398s] [ 72%] 2025-12-04T10:01:44.5854429Z inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_cuda SKIPPED [0.0002s] (Does not support SDPA or pre-SM80 hardware) [ 73%] 2025-12-04T10:01:44.5854728Z inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_freezing_cuda SKIPPED [0.0001s] (Does not support mem_eff_attention) [ 73%] 2025-12-04T10:01:44.5854900Z inductor/test_torchinductor.py::GPUTests::test_searchsorted_cuda PASSED [6.5875s] [ 73%] 2025-12-04T10:01:44.5855070Z inductor/test_torchinductor.py::GPUTests::test_select_scatter_cuda PASSED [0.3304s] [ 73%] 2025-12-04T10:01:44.5855238Z inductor/test_torchinductor.py::GPUTests::test_sgn_extremal_cuda PASSED [0.2098s] [ 74%] 2025-12-04T10:01:44.5855405Z inductor/test_torchinductor.py::GPUTests::test_shape_padding_cuda PASSED [1.5193s] [ 74%] 2025-12-04T10:01:44.5855606Z inductor/test_torchinductor.py::GPUTests::test_should_pad_bench_for_bmm_cuda PASSED [0.0022s] [ 74%] 2025-12-04T10:01:44.5855762Z inductor/test_torchinductor.py::GPUTests::test_sigmoid_cuda PASSED [0.2666s] [ 74%] 2025-12-04T10:01:44.5855921Z inductor/test_torchinductor.py::GPUTests::test_signbit_cuda PASSED [0.3116s] [ 75%] 2025-12-04T10:01:44.5856075Z inductor/test_torchinductor.py::GPUTests::test_silu_cuda PASSED [0.2021s] [ 75%] 2025-12-04T10:01:44.5856233Z inductor/test_torchinductor.py::GPUTests::test_sin_cuda PASSED [0.5077s] [ 75%] 2025-12-04T10:01:44.5856400Z inductor/test_torchinductor.py::GPUTests::test_single_elem_cuda PASSED [0.1739s] [ 75%] 2025-12-04T10:01:44.5856558Z inductor/test_torchinductor.py::GPUTests::test_slice1_cuda PASSED [0.3136s] [ 76%] 2025-12-04T10:01:44.5856709Z inductor/test_torchinductor.py::GPUTests::test_slice3_cuda PASSED [0.3048s] [ 76%] 2025-12-04T10:01:44.5856862Z inductor/test_torchinductor.py::GPUTests::test_slice4_cuda PASSED [0.1414s] [ 76%] 2025-12-04T10:01:44.5857135Z inductor/test_torchinductor.py::GPUTests::test_slice_mutation1_cuda PASSED [0.3135s] [ 77%] 2025-12-04T10:01:44.5857316Z inductor/test_torchinductor.py::GPUTests::test_slice_mutation2_cuda PASSED [0.1844s] [ 77%] 2025-12-04T10:01:44.5857487Z inductor/test_torchinductor.py::GPUTests::test_slice_mutation3_cuda PASSED [0.1269s] [ 77%] 2025-12-04T10:01:44.5857666Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter2_cuda PASSED [0.2339s] [ 77%] 2025-12-04T10:01:44.5857837Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter4_cuda PASSED [0.2713s] [ 78%] 2025-12-04T10:01:44.5858004Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter5_cuda PASSED [0.2881s] [ 78%] 2025-12-04T10:01:44.5858222Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter_dtype_consistency_cuda PASSED [0.7799s] [ 78%] 2025-12-04T10:01:44.5858416Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter_reinplace_cuda PASSED [0.2263s] [ 78%] 2025-12-04T10:01:44.5858609Z inductor/test_torchinductor.py::GPUTests::test_softmax_backward_data_cuda PASSED [0.6636s] [ 79%] 2025-12-04T10:01:44.5858800Z inductor/test_torchinductor.py::GPUTests::test_softmax_one_kernel_loop_cuda PASSED [0.2010s] [ 79%] 2025-12-04T10:01:44.5858965Z inductor/test_torchinductor.py::GPUTests::test_sort_bool_cuda PASSED [49.4021s] [ 79%] 2025-12-04T10:01:44.5859222Z inductor/test_torchinductor.py::GPUTests::test_sort_transpose_cuda PASSED [35.0890s] [ 79%] 2025-12-04T10:01:44.5859404Z inductor/test_torchinductor.py::GPUTests::test_special_polygamma_cuda PASSED [0.8584s] [ 80%] 2025-12-04T10:01:44.5859558Z inductor/test_torchinductor.py::GPUTests::test_split_cuda PASSED [0.3638s] [ 80%] 2025-12-04T10:01:44.5859752Z inductor/test_torchinductor.py::GPUTests::test_split_cumprod_low_prec_cuda PASSED [0.5773s] [ 80%] 2025-12-04T10:01:44.5859919Z inductor/test_torchinductor.py::GPUTests::test_split_cumsum_cuda PASSED [3.2606s] [ 80%] 2025-12-04T10:01:44.5860101Z inductor/test_torchinductor.py::GPUTests::test_split_cumsum_index_cuda PASSED [1.3563s] [ 81%] 2025-12-04T10:01:44.5860292Z inductor/test_torchinductor.py::GPUTests::test_split_cumsum_low_prec_cuda PASSED [0.2131s] [ 81%] 2025-12-04T10:01:44.5860509Z inductor/test_torchinductor.py::GPUTests::test_split_reduction_dynamic_shape_cuda PASSED [0.2838s] [ 81%] 2025-12-04T10:01:44.5860696Z inductor/test_torchinductor.py::GPUTests::test_split_with_integer_cuda PASSED [0.4736s] [ 81%] 2025-12-04T10:01:44.5860862Z inductor/test_torchinductor.py::GPUTests::test_squeeze2_cuda PASSED [0.2441s] [ 82%] 2025-12-04T10:01:44.5861034Z inductor/test_torchinductor.py::GPUTests::test_squeeze_varargs_cuda PASSED [0.1919s] [ 82%] 2025-12-04T10:01:44.5861189Z inductor/test_torchinductor.py::GPUTests::test_std_cuda PASSED [1.0687s] [ 82%] 2025-12-04T10:01:44.5861454Z inductor/test_torchinductor.py::GPUTests::test_stride_preservation_with_stride_modifying_fx_pass_cuda PASSED [0.0964s] [ 82%] 2025-12-04T10:01:44.5861605Z inductor/test_torchinductor.py::GPUTests::test_sum1_cuda PASSED [0.4005s] [ 83%] 2025-12-04T10:01:44.5861770Z inductor/test_torchinductor.py::GPUTests::test_sum2_cuda PASSED [2.5739s] [ 83%] 2025-12-04T10:01:44.5861921Z inductor/test_torchinductor.py::GPUTests::test_sum3_cuda PASSED [0.5709s] [ 83%] 2025-12-04T10:01:44.5862072Z inductor/test_torchinductor.py::GPUTests::test_sum5_cuda PASSED [2.3203s] [ 83%] 2025-12-04T10:01:44.5862231Z inductor/test_torchinductor.py::GPUTests::test_tensor2_cuda PASSED [0.2041s] [ 84%] 2025-12-04T10:01:44.5862415Z inductor/test_torchinductor.py::GPUTests::test_tensor_index_slice_cuda PASSED [0.4563s] [ 84%] 2025-12-04T10:01:44.5862650Z inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_cuda PASSED [0.5010s] [ 84%] 2025-12-04T10:01:44.5862845Z inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue2_cuda PASSED [0.5518s] [ 85%] 2025-12-04T10:01:44.5863022Z inductor/test_torchinductor.py::GPUTests::test_to_device_constant_cuda PASSED [0.6925s] [ 85%] 2025-12-04T10:01:44.5863277Z inductor/test_torchinductor.py::GPUTests::test_to_device_cuda PASSED [0.1781s] [ 85%] 2025-12-04T10:01:44.5863440Z inductor/test_torchinductor.py::GPUTests::test_to_dtype_cuda PASSED [0.2424s] [ 85%] 2025-12-04T10:01:44.5863624Z inductor/test_torchinductor.py::GPUTests::test_to_memory_format_cuda PASSED [0.5104s] [ 86%] 2025-12-04T10:01:44.5863813Z inductor/test_torchinductor.py::GPUTests::test_transposed_propagates_cuda PASSED [0.1279s] [ 86%] 2025-12-04T10:01:44.5864058Z inductor/test_torchinductor.py::GPUTests::test_triton_argmin_argmax_transpose_logical_index_cuda PASSED [2.4021s] [ 86%] 2025-12-04T10:01:44.5864206Z inductor/test_torchinductor.py::GPUTests::test_triu_cuda PASSED [0.3855s] [ 86%] 2025-12-04T10:01:44.5864385Z inductor/test_torchinductor.py::GPUTests::test_uint4x2_mixed_mm_cuda PASSED [0.4660s] [ 87%] 2025-12-04T10:01:44.5864590Z inductor/test_torchinductor.py::GPUTests::test_unbacked_floordiv_simplify_cuda PASSED [0.5240s] [ 87%] 2025-12-04T10:01:44.5864813Z inductor/test_torchinductor.py::GPUTests::test_unbacked_floordiv_simplify_errors_cuda PASSED [0.0130s] [ 87%] 2025-12-04T10:01:44.5864972Z inductor/test_torchinductor.py::GPUTests::test_unbind_cuda PASSED [0.1625s] [ 87%] 2025-12-04T10:01:44.5865174Z inductor/test_torchinductor.py::GPUTests::test_unfold_zero_dimension_tensor_cuda PASSED [0.0627s] [ 88%] 2025-12-04T10:01:44.5865460Z inductor/test_torchinductor.py::GPUTests::test_unsigned_constant_tensors_cuda PASSED [0.5858s] [ 88%] 2025-12-04T10:01:44.5865653Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_bfloat16_cuda PASSED [0.2983s] [ 88%] 2025-12-04T10:01:44.5865840Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_float32_cuda PASSED [0.2682s] [ 88%] 2025-12-04T10:01:44.5866022Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_float64_cuda PASSED [0.3366s] [ 89%] 2025-12-04T10:01:44.5866209Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int32_cuda PASSED [0.2783s] [ 89%] 2025-12-04T10:01:44.5866398Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_uint8_cuda PASSED [0.2614s] [ 89%] 2025-12-04T10:01:44.5866582Z inductor/test_torchinductor.py::GPUTests::test_upsample_bilinear2d_a_cuda PASSED [1.2671s] [ 89%] 2025-12-04T10:01:44.5866758Z inductor/test_torchinductor.py::GPUTests::test_upsample_bilinear2d_b_cuda PASSED [0.7266s] [ 90%] 2025-12-04T10:01:44.5866936Z inductor/test_torchinductor.py::GPUTests::test_upsample_nearest1d_cuda PASSED [0.5000s] [ 90%] 2025-12-04T10:01:44.5867105Z inductor/test_torchinductor.py::GPUTests::test_upsample_nearest2d_cuda PASSED [0.7375s] [ 90%] 2025-12-04T10:01:44.5867281Z inductor/test_torchinductor.py::GPUTests::test_upsample_nearest3d_cuda PASSED [1.4940s] [ 90%] 2025-12-04T10:01:44.5867441Z inductor/test_torchinductor.py::GPUTests::test_var_correction_cuda PASSED [0.7199s] [ 91%] 2025-12-04T10:01:44.5867640Z inductor/test_torchinductor.py::GPUTests::test_var_mean_tile_reduction_False_cuda PASSED [0.5505s] [ 91%] 2025-12-04T10:01:44.5868463Z inductor/test_torchinductor.py::GPUTests::test_var_mean_tile_reduction_True_cuda SKIPPED [0.0005s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/165879 for platform(s) inductor, rocm. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 91%] 2025-12-04T10:01:44.5868647Z inductor/test_torchinductor.py::GPUTests::test_vectorized_ops_masked_cuda PASSED [0.2686s] [ 91%] 2025-12-04T10:01:44.5868852Z inductor/test_torchinductor.py::GPUTests::test_vectorized_ops_masked_var_novec_cuda PASSED [0.2485s] [ 92%] 2025-12-04T10:01:44.5869017Z inductor/test_torchinductor.py::GPUTests::test_view_as_complex_cuda PASSED [0.1463s] [ 92%] 2025-12-04T10:01:44.5869178Z inductor/test_torchinductor.py::GPUTests::test_view_detach_cuda PASSED [0.1310s] [ 92%] 2025-12-04T10:01:44.5869340Z inductor/test_torchinductor.py::GPUTests::test_view_on_aliased_cuda PASSED [0.7445s] [ 93%] 2025-12-04T10:01:44.5869647Z inductor/test_torchinductor.py::GPUTests::test_view_uint8_through_differing_bitwidths_cuda PASSED [0.4243s] [ 93%] 2025-12-04T10:01:44.5869797Z inductor/test_torchinductor.py::GPUTests::test_views4_cuda PASSED [0.8119s] [ 93%] 2025-12-04T10:01:44.5869945Z inductor/test_torchinductor.py::GPUTests::test_views5_cuda PASSED [0.1160s] [ 93%] 2025-12-04T10:01:44.5870092Z inductor/test_torchinductor.py::GPUTests::test_views6_cuda PASSED [0.2122s] [ 94%] 2025-12-04T10:01:44.5870236Z inductor/test_torchinductor.py::GPUTests::test_views7_cuda PASSED [0.2329s] [ 94%] 2025-12-04T10:01:44.5870400Z inductor/test_torchinductor.py::GPUTests::test_weight_norm_bwd_cuda PASSED [0.6615s] [ 94%] 2025-12-04T10:01:44.5870580Z inductor/test_torchinductor.py::GPUTests::test_xblock_divides_xnumel_cuda PASSED [0.3706s] [ 94%] 2025-12-04T10:01:44.5870754Z inductor/test_torchinductor.py::GPUTests::test_zero_dim_reductions_cuda PASSED [0.1710s] [ 95%] 2025-12-04T10:01:44.5870928Z inductor/test_torchinductor.py::GPUTests::test_zero_element_mutation_cuda PASSED [0.1268s] [ 95%] 2025-12-04T10:01:44.5871132Z inductor/test_torchinductor.py::TritonCodeGenTests::test_cant_optimize_compute PASSED [0.4295s] [ 95%] 2025-12-04T10:01:44.5871373Z inductor/test_torchinductor.py::TritonCodeGenTests::test_ctr_not_moved_to_cuda_when_used_in_index_put PASSED [0.4411s] [ 95%] 2025-12-04T10:01:44.5871656Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace PASSED [0.5858s] [ 96%] 2025-12-04T10:01:44.5871890Z inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads_block_ptr PASSED [4.1575s] [ 96%] 2025-12-04T10:01:44.5872130Z inductor/test_torchinductor.py::TritonCodeGenTests::test_graph_partition_default_device_context PASSED [0.1159s] [ 96%] 2025-12-04T10:01:44.5872354Z inductor/test_torchinductor.py::TritonCodeGenTests::test_grouped_mm SKIPPED [0.0002s] (no grouped_mm support) [ 96%] 2025-12-04T10:01:44.5872661Z inductor/test_torchinductor.py::TritonCodeGenTests::test_has_constant_mask_block_multiple_False_ynumel_exceed_ygrid_size_False PASSED [0.1242s] [ 97%] 2025-12-04T10:01:44.5872956Z inductor/test_torchinductor.py::TritonCodeGenTests::test_has_constant_mask_block_multiple_True_ynumel_exceed_ygrid_size_False PASSED [0.1310s] [ 97%] 2025-12-04T10:01:44.5873249Z inductor/test_torchinductor.py::TritonCodeGenTests::test_has_constant_mask_block_multiple_True_ynumel_exceed_ygrid_size_True PASSED [0.1502s] [ 97%] 2025-12-04T10:01:44.5873487Z inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_detach_view_backend_inductor PASSED [0.1371s] [ 97%] 2025-12-04T10:01:44.5873681Z inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_sequence_nr PASSED [2.0533s] [ 98%] 2025-12-04T10:01:44.5873910Z inductor/test_torchinductor.py::TritonCodeGenTests::test_not_materialize_pointwise_reduction PASSED [0.4606s] [ 98%] 2025-12-04T10:01:44.5874086Z inductor/test_torchinductor.py::TritonCodeGenTests::test_numpy_on_gpu PASSED [0.1676s] [ 98%] 2025-12-04T10:01:44.5874276Z inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_compute PASSED [0.8806s] [ 98%] 2025-12-04T10:01:44.5874483Z inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_assert PASSED [1.2377s] [ 99%] 2025-12-04T10:01:44.5874724Z inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype_with_constraint PASSED [0.4021s] [ 99%] 2025-12-04T10:01:44.5874883Z inductor/test_torchinductor.py::RNNTest::test_rnn_compile_safe PASSED [0.7680s] [ 99%] 2025-12-04T10:01:44.5875063Z inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_pass PASSED [0.1866s] [100%] 2025-12-04T10:01:44.5875066Z 2025-12-04T10:01:44.5875423Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml - 2025-12-04T10:01:44.5875549Z ========= 371 passed, 16 skipped, 137 deselected in 460.44s (0:07:40) ========== 2025-12-04T10:01:44.5875839Z The following tests failed consistently: ['test/inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda'] 2025-12-04T10:01:44.5875929Z 2025-12-04T10:01:44.5876211Z FINISHED PRINTING LOG FILE of inductor/test_torchinductor 2/2 (test/test-reports/inductor.test_torchinductor_2.2_61b015180443276d_.log) 2025-12-04T10:01:44.5876214Z 2025-12-04T10:01:44.5876400Z Finished inductor/test_torchinductor 2/2 ... [2025-12-04 10:01:44.489270][212195.74864886], took 10.09min 2025-12-04T10:01:44.5876780Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:01:44.5876917Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:01:44.5877060Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:01:44.5877131Z Uploading artifacts took 0.00 seconds 2025-12-04T10:01:44.5877204Z inductor/test_torchinductor 2/2 failed! 2025-12-04T10:01:44.5877363Z Running inductor/test_flex_attention 1/6 ... [2025-12-04 10:01:44.499635][212195.759016507] 2025-12-04T10:01:44.5877434Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:01:44.5877916Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_attention.py', '--shard-id=1', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:01:44.499840] 2025-12-04T10:09:45.7564907Z 2025-12-04T10:09:45.7566156Z inductor/test_flex_attention 1/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_attention_1.6_6b97ff755e0d95c8_.log 2025-12-04T10:09:45.7639187Z Running 129 items in this shard: test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_reduction_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_causal_block_paged_attention_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_device_cuda_1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_dynamic_divisibility_guards_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order3_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order1_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order4_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order0_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order3_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order4_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order0_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_force_write_lse_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_fw_bw_graph_correctness_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_inputs_are_realized_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_large_batch_heads_grid_dimension_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_load_from_bias_head_seq_batch_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_load_from_bias_seq_batch_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_correctness_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_lse_masked_output_backend_flex_attention_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_mask_mod_combiners_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_mask_mod_handles_symint_addition_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_multiple_mask_calls_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_natten_2d_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod0_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod2_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod3_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod4_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod4_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod5_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod6_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod7_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod7_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod7_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_pow_2_headdim_head_dim_17_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__squared_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__identity_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__rel_bias_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s0_v_s0_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s0_v_s0_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s1_v_s1_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s2_v_s2_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_allocate_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod2_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod3_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod4_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod6_cuda_float32, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_backward_error_with_none_q_indices_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_mask_device_change_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_mask_viz_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_changes_BLOCK_SIZE_32_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_broadcasted_head_block_mask_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_flex_attention_poison_mod_fwd_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_from_kv_blocks_without_q_computation_full_indices_True_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_init_mismatched_full_kv_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flex_attention_logging_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flex_attention_with_dynamic_max_autotune_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda 2025-12-04T10:09:45.7734300Z 2025-12-04T10:09:45.7734557Z Finished inductor/test_flex_attention 1/6 ... [2025-12-04 10:09:45.756411][212677.015786233], took 8.02min 2025-12-04T10:09:45.7735430Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:09:45.7736202Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:09:45.7736906Z Running inductor/test_flex_attention 3/6 ... [2025-12-04 10:09:45.771368][212677.030746807] 2025-12-04T10:09:45.7737631Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:09:45.7739154Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_attention.py', '--shard-id=3', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:09:45.771819] 2025-12-04T10:17:38.7765252Z 2025-12-04T10:17:38.7766665Z inductor/test_flex_attention 3/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_attention_3.6_83bc3fac28bd12fe_.log 2025-12-04T10:17:38.7822620Z Running 110 items in this shard: test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_aot_eager_gradcheck_score_mod0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_backend_auto_matches_triton_large_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod6_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_doc_mask_sparse_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_dynamic_captured_buffer_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_dynamic_shapes_with_max_autotune_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order0_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order4_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order1_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order2_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order2_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order0_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order1_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order1_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order1_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order1_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order2_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_float32_matmul_precision_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_fully_masked_out_rows_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_load_from_bias_seq_only_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_load_rel_bias_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_make_block_mask_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_max_autotune_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_multiple_score_mod_calls_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_contiguous_last_dim_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod0_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod0_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod1_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod2_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod4_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod5_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod6_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod7_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_recompile_changed_score_mod_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_reduction_unrolled_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__inverse_causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_selective_ac_ops_to_save1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_skip_odd_keys_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s1_v_s1_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s0_v_s0_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s3_v_s3_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_zero_length_sequence_error_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_convert_mask_mod_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod5_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod6_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod7_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_changes_BLOCK_SIZE5_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_create_is_cuda_graphable_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_from_kv_blocks_full_indices_True_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_sliced_blockmask_mask_mod_error_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flex_attention_with_dynamic_max_autotune_graph_partition_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_bias_req_grad_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_inspect_bug_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda 2025-12-04T10:17:38.7898240Z 2025-12-04T10:17:38.7898708Z Finished inductor/test_flex_attention 3/6 ... [2025-12-04 10:17:38.776480][213150.035855975], took 7.88min 2025-12-04T10:17:38.7900298Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:17:38.7974024Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:17:38.7978918Z Running inductor/test_flex_attention 5/6 ... [2025-12-04 10:17:38.797536][213150.056913993] 2025-12-04T10:17:38.7979715Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:17:38.7984695Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_attention.py', '--shard-id=5', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:17:38.798058] 2025-12-04T10:26:10.1604328Z 2025-12-04T10:26:10.1605509Z inductor/test_flex_attention 5/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_attention_5.6_f707ded96caf3521_.log 2025-12-04T10:26:10.1648821Z Running 137 items in this shard: test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_backend_triton_decode_errors_when_not_supported_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_backend_triton_decode_matches_auto_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod4_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod4_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_buffers_all_dims_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_scale_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_causal_block_non_divisible_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_custom_block_mask_generator_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_differentiable_logsumexp_compiled_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_differentiable_logsumexp_gradcheck_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_dynamic_shapes_with_custom_kernel_options_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order1_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order2_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order2_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order4_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order4_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order2_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order1_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order3_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order4_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order2_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order2_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order3_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_index_multiple_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_index_weird2_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_invalid_block_size_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_correctness_score_mod0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_multiple_score_mod_calls2_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_divisible_with_captured_buffer_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod0_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod2_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod5_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod5_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod6_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_pow_2_headdim_head_dim_121_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_pow_2_headdim_head_dim_94_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_qkv_and_block_mask_on_the_same_device_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__identity_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__rel_bias_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__alibi_bias_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_selective_ac_ops_to_save2_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_skip_odd_keys_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_small_q_kv_len_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_backwards_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s1_v_s1_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s1_v_s1_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s2_v_s2_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s2_v_s2_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s2_v_s2_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s3_v_s3_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_tensor_subclass_dispatch_order_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_page_allocation_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod4_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_mask_vs_sequence_lengths_compile_False_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_changes_BLOCK_SIZE_128_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_compiling_create_block_mask_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_flex_attention_poison_mod_bwd_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_flex_attention_poisoned_rel_logits_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_from_kv_blocks_full_indices_False_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda 2025-12-04T10:26:10.1691525Z 2025-12-04T10:26:10.1691722Z Finished inductor/test_flex_attention 5/6 ... [2025-12-04 10:26:10.160586][213661.419960647], took 8.52min 2025-12-04T10:26:10.1692384Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:26:10.1741398Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:26:10.1742196Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:26:10.1742854Z Uploading artifacts took 0.00 seconds 2025-12-04T10:26:10.1746906Z Running inductor/test_pad_mm 1/1 ... [2025-12-04 10:26:10.174424][213661.433802207] 2025-12-04T10:26:10.1747599Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:26:10.1751086Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_pad_mm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:26:10.174894] 2025-12-04T10:27:52.4313359Z 2025-12-04T10:27:52.4314688Z inductor/test_pad_mm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_pad_mm_1.1_8a03e681155b7cfe_.log 2025-12-04T10:27:52.4319954Z Running 19 items in this shard: test/inductor/test_pad_mm.py::PadMMTest::test_cat_pad_mm_dyn_m, test/inductor/test_pad_mm.py::PadMMTest::test_exclude_cat_padding, test/inductor/test_pad_mm.py::PadMMTest::test_exclude_padding, test/inductor/test_pad_mm.py::PadMMTest::test_no_autocast_in_pad_bmm_joint_graph_pass, test/inductor/test_pad_mm.py::PadMMTest::test_original_aten_preserved_pad_mm, test/inductor/test_pad_mm.py::PadMMTest::test_pad_addmm_2d_bias, test/inductor/test_pad_mm.py::PadMMTest::test_pad_addmm_dyn_m, test/inductor/test_pad_mm.py::PadMMTest::test_pad_addmm_dyn_mn, test/inductor/test_pad_mm.py::PadMMTest::test_pad_batch, test/inductor/test_pad_mm.py::PadMMTest::test_pad_bmm_dyn_b, test/inductor/test_pad_mm.py::PadMMTest::test_pad_bmm_dyn_bm, test/inductor/test_pad_mm.py::PadMMTest::test_pad_bmm_dyn_k, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_bf16, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_k, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_m, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_mnk, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_n, test/inductor/test_pad_mm.py::PadMMTest::test_pad_single_cat, test/inductor/test_pad_mm.py::PadMMTest::test_zero_dim 2025-12-04T10:27:52.4327840Z 2025-12-04T10:27:52.4328061Z Finished inductor/test_pad_mm 1/1 ... [2025-12-04 10:27:52.431276][213763.690653015], took 1.70min 2025-12-04T10:27:52.4328881Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:27:52.4515467Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:27:52.4521362Z Running inductor/test_benchmark_fusion 1/1 ... [2025-12-04 10:27:52.451658][213763.71103588] 2025-12-04T10:27:52.4522163Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:27:52.4525906Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_benchmark_fusion.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:27:52.452147] 2025-12-04T10:28:23.7267187Z 2025-12-04T10:28:23.7271185Z inductor/test_benchmark_fusion 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_benchmark_fusion_1.1_8c6a72597c36684b_.log 2025-12-04T10:28:23.7279585Z Running 16 items in this shard: test/inductor/test_benchmark_fusion.py::BenchmarkFusionGpuTest::test_avoid_register_spilling_cuda, test/inductor/test_benchmark_fusion.py::BenchmarkFusionGpuTest::test_foreach_kernel_cuda, test/inductor/test_benchmark_fusion.py::BenchmarkFusionGpuTest::test_register_spills_cuda, test/inductor/test_benchmark_fusion.py::BenchmarkFusionGpuTest::test_resnet18_cuda, test/inductor/test_benchmark_fusion.py::BenchmarkFusionGpuTest::test_softmax_cuda, test/inductor/test_benchmark_fusion.py::BenchmarkFusionGpuTest::test_tield_kernel_fusion_cuda, test/inductor/test_benchmark_fusion.py::BenchmarkingTest::test_benchmark_on_non_zero_device, test/inductor/test_benchmark_fusion.py::BenchmarkMultiTemplateFusionGpuTest::test_changed_layout, test/inductor/test_benchmark_fusion.py::BenchmarkMultiTemplateFusionGpuTest::test_equivalent_extern_code, test/inductor/test_benchmark_fusion.py::BenchmarkMultiTemplateFusionGpuTest::test_equivalent_template_code, test/inductor/test_benchmark_fusion.py::BenchmarkFusionCpuTest::test_avoid_register_spilling_cpu, test/inductor/test_benchmark_fusion.py::BenchmarkFusionCpuTest::test_foreach_kernel_cpu, test/inductor/test_benchmark_fusion.py::BenchmarkFusionCpuTest::test_register_spills_cpu, test/inductor/test_benchmark_fusion.py::BenchmarkFusionCpuTest::test_resnet18_cpu, test/inductor/test_benchmark_fusion.py::BenchmarkFusionCpuTest::test_softmax_cpu, test/inductor/test_benchmark_fusion.py::BenchmarkFusionCpuTest::test_tield_kernel_fusion_cpu 2025-12-04T10:28:23.7284310Z 2025-12-04T10:28:23.7284561Z Finished inductor/test_benchmark_fusion 1/1 ... [2025-12-04 10:28:23.726430][213794.985805021], took 0.52min 2025-12-04T10:28:23.7285391Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-ea2ab4f276b6917a.xml 2025-12-04T10:28:23.7476282Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:28:23.7481874Z Running inductor/test_decompose_mem_bound_mm 1/1 ... [2025-12-04 10:28:23.747734][213795.007101096] 2025-12-04T10:28:23.7482743Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:28:23.7486021Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_decompose_mem_bound_mm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:28:23.748205] 2025-12-04T10:29:05.2956676Z 2025-12-04T10:29:05.2958015Z inductor/test_decompose_mem_bound_mm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_decompose_mem_bound_mm_1.1_f9320608d60705ac_.log 2025-12-04T10:29:05.2976167Z Running 37 items in this shard: test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_check_device, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_b_10240_m_2_k_2_n_2_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_b_10240_m_2_k_32_n_32_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_b_2000_m_2_k_2_n_2_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_cpu_b_1_m_2_k_2_n_2_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_cpu_b_2_m_2_k_2_n_2_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_cpu_m_1_k_64_n_16_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_cpu_m_1_k_64_n_32_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_cpu_m_2_k_64_n_16_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_dynamic_shape_decompose_addmm, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_dynamic_shape_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_dynamic_shape_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_realize_input 2025-12-04T10:29:05.2987394Z 2025-12-04T10:29:05.2987609Z Finished inductor/test_decompose_mem_bound_mm 1/1 ... [2025-12-04 10:29:05.295165][213836.554542804], took 0.69min 2025-12-04T10:29:05.2988356Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T10:29:05.4515640Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:29:05.4524875Z Running test_torch 1/1 ... [2025-12-04 10:29:05.452243][213836.711619532] 2025-12-04T10:29:05.4525168Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:29:05.4529421Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_torch.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:29:05.452531] 2025-12-04T10:31:58.7672131Z 2025-12-04T10:31:58.7673115Z test_torch 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_torch_1.1_2bf2fc3fc5519982_.log 2025-12-04T10:31:58.7849431Z Running 976 items in this shard: test/test_torch.py::TestBasicVitalSigns::test_basic_vitals, test/test_torch.py::TestBasicVitalSigns::test_basic_vitals_read_write, test/test_torch.py::TestBasicVitalSigns::test_dataloader_vitals, test/test_torch.py::TestTorch::test_RNGState, test/test_torch.py::TestTorch::test_RNGStateAliasing, test/test_torch.py::TestTorch::test_RNG_after_pickle, test/test_torch.py::TestTorch::test_Size, test/test_torch.py::TestTorch::test_Size_concat_non_tuple_sequence, test/test_torch.py::TestTorch::test_Size_concat_wildcard, test/test_torch.py::TestTorch::test_Size_iter, test/test_torch.py::TestTorch::test_Size_scalar, test/test_torch.py::TestTorch::test_add_meta_scalar, test/test_torch.py::TestTorch::test_allow_tensor_metadata_change, test/test_torch.py::TestTorch::test_apply, test/test_torch.py::TestTorch::test_as_subclass, test/test_torch.py::TestTorch::test_assert_async, test/test_torch.py::TestTorch::test_backward_hooks_traverse, test/test_torch.py::TestTorch::test_batch_norm_cpu_inference, test/test_torch.py::TestTorch::test_bf16_supported_on_cpu, test/test_torch.py::TestTorch::test_bmm_multithreaded, test/test_torch.py::TestTorch::test_boxMullerState, test/test_torch.py::TestTorch::test_cat_neg_dim, test/test_torch.py::TestTorch::test_check, test/test_torch.py::TestTorch::test_chunk_neg_dim, test/test_torch.py::TestTorch::test_conj_neg_tolist, test/test_torch.py::TestTorch::test_conj_physical_meta_stride, test/test_torch.py::TestTorch::test_contains, test/test_torch.py::TestTorch::test_copy_broadcast, test/test_torch.py::TestTorch::test_copy_dtypes, test/test_torch.py::TestTorch::test_copy_float16, test/test_torch.py::TestTorch::test_copy_many_to_one, test/test_torch.py::TestTorch::test_copy_transpose, test/test_torch.py::TestTorch::test_cuda_not_built, test/test_torch.py::TestTorch::test_cummax_neg_dim, test/test_torch.py::TestTorch::test_cummin_neg_dim, test/test_torch.py::TestTorch::test_cumprod_neg_dim, test/test_torch.py::TestTorch::test_cumsum_neg_dim, test/test_torch.py::TestTorch::test_cxx_flags, test/test_torch.py::TestTorch::test_data_ptr_of_empty_tensor_with_storage, test/test_torch.py::TestTorch::test_data_ptr_of_empty_view_with_storage, test/test_torch.py::TestTorch::test_deepcopy_gradient, test/test_torch.py::TestTorch::test_deepcopy_parameter, test/test_torch.py::TestTorch::test_deterministic_fill_uninitialized_memory, test/test_torch.py::TestTorch::test_deterministic_flag, test/test_torch.py::TestTorch::test_device, test/test_torch.py::TestTorch::test_dim_order, test/test_torch.py::TestTorch::test_dir, test/test_torch.py::TestTorch::test_doc, test/test_torch.py::TestTorch::test_doc_template, test/test_torch.py::TestTorch::test_dot_data_use, test/test_torch.py::TestTorch::test_dtype_is_signed, test/test_torch.py::TestTorch::test_element_size, test/test_torch.py::TestTorch::test_empty_meta, test/test_torch.py::TestTorch::test_empty_storage_view, test/test_torch.py::TestTorch::test_equal, test/test_torch.py::TestTorch::test_error_msg_type_translation, test/test_torch.py::TestTorch::test_fill_diagonal, test/test_torch.py::TestTorch::test_format_scalar_meta, test/test_torch.py::TestTorch::test_from_buffer, test/test_torch.py::TestTorch::test_from_file, test/test_torch.py::TestTorch::test_gather_neg_dim, test/test_torch.py::TestTorch::test_generator_cpu, test/test_torch.py::TestTorch::test_get_cpu_capability, test/test_torch.py::TestTorch::test_has_internal_overlap, test/test_torch.py::TestTorch::test_has_storage, test/test_torch.py::TestTorch::test_index_add, test/test_torch.py::TestTorch::test_index_add_all_dtypes, test/test_torch.py::TestTorch::test_index_add_cornercase, test/test_torch.py::TestTorch::test_index_add_correctness, test/test_torch.py::TestTorch::test_index_add_neg_dim, test/test_torch.py::TestTorch::test_index_copy_neg_dim, test/test_torch.py::TestTorch::test_index_fill_neg_dim, test/test_torch.py::TestTorch::test_index_select_neg_dim, test/test_torch.py::TestTorch::test_invalid_arg_error_handling, test/test_torch.py::TestTorch::test_invalid_generator_raises, test/test_torch.py::TestTorch::test_is_nonzero, test/test_torch.py::TestTorch::test_is_same_size, test/test_torch.py::TestTorch::test_iter, test/test_torch.py::TestTorch::test_kthvalue_neg_dim, test/test_torch.py::TestTorch::test_linspace_logspace, test/test_torch.py::TestTorch::test_logcumsumexp_neg_dim, test/test_torch.py::TestTorch::test_manual_seed, test/test_torch.py::TestTorch::test_map, test/test_torch.py::TestTorch::test_map2, test/test_torch.py::TestTorch::test_max_neg_dim, test/test_torch.py::TestTorch::test_mean_neg_dim, test/test_torch.py::TestTorch::test_median_neg_dim, test/test_torch.py::TestTorch::test_memory_format, test/test_torch.py::TestTorch::test_memory_format_contiguous_returns_same_tensor_if_already_satisfies, test/test_torch.py::TestTorch::test_memory_format_empty, test/test_torch.py::TestTorch::test_min_neg_dim, test/test_torch.py::TestTorch::test_mode_neg_dim, test/test_torch.py::TestTorch::test_multinomial_invalid_probs, test/test_torch.py::TestTorch::test_nanmedian_neg_dim, test/test_torch.py::TestTorch::test_narrow_neg_dim, test/test_torch.py::TestTorch::test_nbytes, test/test_torch.py::TestTorch::test_ndim, test/test_torch.py::TestTorch::test_new, test/test_torch.py::TestTorch::test_newaxis_numpy_comparison, test/test_torch.py::TestTorch::test_newindex, test/test_torch.py::TestTorch::test_no_cuda_monkeypatch, test/test_torch.py::TestTorch::test_norm_neg_dim, test/test_torch.py::TestTorch::test_normal_shape, test/test_torch.py::TestTorch::test_numel, test/test_torch.py::TestTorch::test_parallel_info, test/test_torch.py::TestTorch::test_parsing_double, test/test_torch.py::TestTorch::test_parsing_int64, test/test_torch.py::TestTorch::test_parsing_intlist, test/test_torch.py::TestTorch::test_permute, test/test_torch.py::TestTorch::test_pickle, test/test_torch.py::TestTorch::test_pickle_dtype, test/test_torch.py::TestTorch::test_pickle_function, test/test_torch.py::TestTorch::test_pickle_generator, test/test_torch.py::TestTorch::test_pickle_parameter, test/test_torch.py::TestTorch::test_pickle_parameter_no_requires_grad, test/test_torch.py::TestTorch::test_pickle_size, test/test_torch.py::TestTorch::test_pin_memory, test/test_torch.py::TestTorch::test_print, test/test_torch.py::TestTorch::test_prod_neg_dim, test/test_torch.py::TestTorch::test_pyobj_preserved, test/test_torch.py::TestTorch::test_qengine, test/test_torch.py::TestTorch::test_renorm_neg_dim, test/test_torch.py::TestTorch::test_resizable, test/test_torch.py::TestTorch::test_reversed, test/test_torch.py::TestTorch::test_scatter_neg_dim, test/test_torch.py::TestTorch::test_select_neg_dim, test/test_torch.py::TestTorch::test_set_flush_denormal, test/test_torch.py::TestTorch::test_setting_real_imag_to_a_number, test/test_torch.py::TestTorch::test_show_config, test/test_torch.py::TestTorch::test_size_neg_dim, test/test_torch.py::TestTorch::test_size_stride, test/test_torch.py::TestTorch::test_sizeof, test/test_torch.py::TestTorch::test_slice, test/test_torch.py::TestTorch::test_slow_test, test/test_torch.py::TestTorch::test_sobolengine_bounds, test/test_torch.py::TestTorch::test_sobolengine_bounds_scrambled, test/test_torch.py::TestTorch::test_sobolengine_continuing, test/test_torch.py::TestTorch::test_sobolengine_continuing_scrambled, test/test_torch.py::TestTorch::test_sobolengine_default_dtype, test/test_torch.py::TestTorch::test_sobolengine_distribution, test/test_torch.py::TestTorch::test_sobolengine_distribution_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw, test/test_torch.py::TestTorch::test_sobolengine_draw_base2, test/test_torch.py::TestTorch::test_sobolengine_draw_base2_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw_scrambled, test/test_torch.py::TestTorch::test_sobolengine_fast_forward, test/test_torch.py::TestTorch::test_sobolengine_fast_forward_scrambled, test/test_torch.py::TestTorch::test_sobolengine_first_point, test/test_torch.py::TestTorch::test_sobolengine_high_dim, test/test_torch.py::TestTorch::test_sobolengine_raise, test/test_torch.py::TestTorch::test_sobolengine_reset, test/test_torch.py::TestTorch::test_sobolengine_reset_scrambled, test/test_torch.py::TestTorch::test_sort_neg_dim, test/test_torch.py::TestTorch::test_split_neg_dim, test/test_torch.py::TestTorch::test_split_with_sizes_copy_out, test/test_torch.py::TestTorch::test_squeeze_neg_dim, test/test_torch.py::TestTorch::test_std_neg_dim, test/test_torch.py::TestTorch::test_storage_base_init, test/test_torch.py::TestTorch::test_storage_base_new, test/test_torch.py::TestTorch::test_storage_byteswap, test/test_torch.py::TestTorch::test_storage_casts, test/test_torch.py::TestTorch::test_storage_cycle_via_dict, test/test_torch.py::TestTorch::test_storage_cycle_via_slots, test/test_torch.py::TestTorch::test_storage_dead_weak_ref, test/test_torch.py::TestTorch::test_storage_dealloc, test/test_torch.py::TestTorch::test_storage_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_zombie, test/test_torch.py::TestTorch::test_storage_dict_dealloc, test/test_torch.py::TestTorch::test_storage_error, test/test_torch.py::TestTorch::test_storage_error_no_attribute, test/test_torch.py::TestTorch::test_storage_finalizer_dealloc, test/test_torch.py::TestTorch::test_storage_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_zombie, test/test_torch.py::TestTorch::test_storage_preserve_nonhermetic_in_hermetic_context, test/test_torch.py::TestTorch::test_storage_resurrected_weak_ref, test/test_torch.py::TestTorch::test_storage_slot_dealloc, test/test_torch.py::TestTorch::test_storage_thread_safety, test/test_torch.py::TestTorch::test_storage_weakref_dealloc, test/test_torch.py::TestTorch::test_structseq_repr, test/test_torch.py::TestTorch::test_subclass_preserved, test/test_torch.py::TestTorch::test_subclass_tensors, test/test_torch.py::TestTorch::test_sum_neg_dim, test/test_torch.py::TestTorch::test_swap_basic, test/test_torch.py::TestTorch::test_swap_fail_slots, test/test_torch.py::TestTorch::test_t_not_2d_error, test/test_torch.py::TestTorch::test_tensor_base_init, test/test_torch.py::TestTorch::test_tensor_base_new, test/test_torch.py::TestTorch::test_tensor_ctor_scalar, test/test_torch.py::TestTorch::test_tensor_cycle_via_dict, test/test_torch.py::TestTorch::test_tensor_cycle_via_slots, test/test_torch.py::TestTorch::test_tensor_dead_weak_ref, test/test_torch.py::TestTorch::test_tensor_dict_dealloc, test/test_torch.py::TestTorch::test_tensor_finalizer_dealloc, test/test_torch.py::TestTorch::test_tensor_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_tensor_item_no_warning, test/test_torch.py::TestTorch::test_tensor_ressurecting_clear, test/test_torch.py::TestTorch::test_tensor_resurrected_weak_ref, test/test_torch.py::TestTorch::test_tensor_set, test/test_torch.py::TestTorch::test_tensor_set_errors, test/test_torch.py::TestTorch::test_tensor_slot_dealloc, test/test_torch.py::TestTorch::test_tensor_weakref_dealloc, test/test_torch.py::TestTorch::test_tensor_where_scalar, test/test_torch.py::TestTorch::test_tensor_with_grad_to_scalar_warning, test/test_torch.py::TestTorch::test_tensoriterator_output_setup, test/test_torch.py::TestTorch::test_terminate_handler_on_crash, test/test_torch.py::TestTorch::test_to, test/test_torch.py::TestTorch::test_to_with_tensor, test/test_torch.py::TestTorch::test_topk_neg_dim, test/test_torch.py::TestTorch::test_torch_from_file, test/test_torch.py::TestTorch::test_transpose_neg_dim, test/test_torch.py::TestTorch::test_type, test/test_torch.py::TestTorch::test_type_alias, test/test_torch.py::TestTorch::test_type_conversion_via_dtype_name, test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning, test/test_torch.py::TestTorch::test_typed_storage_internal_no_warning, test/test_torch.py::TestTorch::test_unbind_neg_dim, test/test_torch.py::TestTorch::test_unflatten, test/test_torch.py::TestTorch::test_unfold_neg_dim, test/test_torch.py::TestTorch::test_unsqueeze_neg_dim, test/test_torch.py::TestTorch::test_upsample_nearest1d_meta, test/test_torch.py::TestTorch::test_upsample_nearest2d_meta, test/test_torch.py::TestTorch::test_var_neg_dim, test/test_torch.py::TestTorch::test_warn_types, test/test_torch.py::TestTorch::test_wildcard_import, test/test_torch.py::TestVitalSignsCudaCUDA::test_cuda_vitals_gpu_only_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test__local_scalar_dense_with_empty_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_errors_with_cpu_scalars_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_assertRaisesRegex_ignore_msg_non_native_device_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bfloat16_neg_abs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bool_tensor_value_change_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_add_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcdiv_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcmul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_atan2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_copy_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_div_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_eq_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_fmod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ge_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_gt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_le_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lerp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_fill_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_scatter_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_select_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_min_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_mul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ne_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_pow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_remainder_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_sub_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_cuda_backward_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_euclidean_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_grad_p_lt_1_no_nan_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_same_inputs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_all_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_internal_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_not_memory_dense_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_zero_stride_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_complex_half_experimental_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_constants_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_backward_agnostic_to_memory_format_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_math_view_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cov_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cpp_warnings_have_python_context_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_cummin_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummin_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumprod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_64bit_indexing_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_outer_dim_64bit_indexing_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_replication_pad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_device_guard_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dim_function_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_discontiguous_out_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dtypetensor_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_expected_failure_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_deterministic_path_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_one_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scale_will_not_overflow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_deprecated_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_pass_itself_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_accumulation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_separate_unscale_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_multiple_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_penalty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_state_dict_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_sparse_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_update_scale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_type_promotion_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_hook_remove_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_large_inputs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_int64_upsample3d_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_invalid_shapes_grid_sampler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_set_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_signed_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e4m3fn, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e4m3fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e5m2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e5m2fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumprod_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumsum_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_logcumsumexp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lognormal_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_inplace_noncontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_large_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_clone_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_consistency_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_cpu_and_cuda_ops_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_empty_like_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_factory_like_functions_preserve_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_operators_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_preserved_after_permute_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_propagation_rules_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_shortcuts_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_module_share_memory_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_w_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_wo_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_gpu_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_rng_state_advance_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_copy_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_no_nondeterministic_alert_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_no_nondeterministic_alert_interpolate_trilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_CTCLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_EmbeddingBag_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_NLLLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_bincount_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_histc_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_histc_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bicubic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_linear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_trilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_median_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_accumulate_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint2x4, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nullary_op_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pairwise_distance_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_and_graph_partition_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_and_graph_partition_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_and_graph_partition_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_norm_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pickle_gradscaler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pin_memory_from_constructor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_repeat_interleave_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scalar_check_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_non_unique_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_one_dim_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_zero_size_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_serialization_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_default_tensor_type_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_shift_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_skip_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_non_blocking_False_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_non_blocking_True_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_use_count_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_strides_propagation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_sync_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_set_errors_multigpu_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_shape_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_ternary_op_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_untyped_storage_meta_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_warn_always_caught_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_where_scalar_handcrafted_values_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_cpu_devices_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_devices_error_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_broadcast_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_noncontig_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_cuda_device_idx_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_device_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int8, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_uint8, test/test_torch.py::TestDevicePrecisionCUDA::test_index_add_bfloat16_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_multidevice_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_type_conversions_same_device_cuda 2025-12-04T10:31:58.8013823Z 2025-12-04T10:31:58.8013981Z Finished test_torch 1/1 ... [2025-12-04 10:31:58.767880][214010.027258752], took 2.89min 2025-12-04T10:31:58.8014642Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T10:31:58.8991365Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:31:58.9018305Z Running test_ops 2/7 ... [2025-12-04 10:31:58.901214][214010.160590817] 2025-12-04T10:31:58.9018923Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:31:58.9020341Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=2', '--num-shards=7', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:31:58.901494] 2025-12-04T10:42:29.7250389Z 2025-12-04T10:42:29.7253072Z test_ops 2/7 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_2.7_75a01f0f4b3a0475_.log 2025-12-04T10:42:29.8196363Z Running 4808 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_permuted_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_mT_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nanmean_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_permute_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_randn_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_repeat_interleave_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rand___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__batch_norm_with_update_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_logit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__softmax_backward_data_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__unsafe_masked_index_put_accumulate_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argwhere_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bincount_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clone_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cov_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diff_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_istft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_kthvalue_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_factor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_multi_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logcumsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mT_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_median_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_matrix_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_msort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_area_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_linear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_kl_div_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_head_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multilabel_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_one_hot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randn_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_decimals_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_modified_bessel_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_T_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_errors_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_amin_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_eye_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ldexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_lstsq_cuda, test/test_ops.py::TestCommonCUDA::test_errors_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_errors_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_errors_trace_cuda, test/test_ops.py::TestCommonCUDA::test_errors_tril_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmod___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_histc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mH_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_airy_ai_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_H_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values__chunk_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_angle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_combinations_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ldexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_with_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nansum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_static_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_put_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resize__cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_airy_ai_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zero__cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geqrf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_aminmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_nuttall_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning__native_batch_norm_legit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_byte_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_half_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_short_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_acos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_asin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_round_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_arange_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_baddbmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bincount_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_byte_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_corrcoef_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cov_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_floor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vander_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_binary_cross_entropy_with_logits_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_ctc_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest-exact_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_positive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_airy_ai_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_laguerre_polynomial_l_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zero__cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float8_e5m2, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float8_e5m2fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_amax_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_polar_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_shapes_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e5m2, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_any_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_corrcoef_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softsign_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_static_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isposinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vstack_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rsub___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_allclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_expm1_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_float_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_mul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reciprocal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reshape_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rsqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rsub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_triu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unbind_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_where_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_bfloat16_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_block_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_bool_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_combinations_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_corrcoef_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cov_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_expm1_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_gather_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_put_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vander_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mH_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_movedim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_rms_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_fro_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pinverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_interleave_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resize__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_multiple_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmatmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rsub___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_float_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_half_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_contiguous_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isinf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_istft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log1p_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_or_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_channel_shuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sigmoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_triu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcdiv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_all_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bool_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_byte_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_embed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isreal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ldexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_slogdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vander_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mH_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_linear_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_rms_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_permute_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ravel_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rsub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sgn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_square_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_take_along_dim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trapezoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tril_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unflatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_as_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view_T_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__native_batch_norm_legit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_byte_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_long_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_alias_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_arange_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_contiguous_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erfinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fliplr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flipud_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ge_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_hypot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isreal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log10_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log1p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nan_to_num_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ne_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rad2deg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_real_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reshape_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rot90_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sinc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i1e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_ndtri_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tril_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unbind_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unflatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_xlogy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__unsafe_masked_index_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_abs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_acos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addcdiv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_baddbmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_contiguous_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumulative_trapezoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_deg2rad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_div_trunc_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_frac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_grid_sampler_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_grid_sampler_3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_unary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_triangular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vector_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log10_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logdet_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_long_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mH_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_matmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_batch_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ne_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardswish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_linear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_local_response_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_reflect_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_silu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_soft_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_threshold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_fro_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ormqr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pca_lowrank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pinverse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize_as__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_short_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_hann_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_u_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i0e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i1e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_svd_lowrank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_triu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unflatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsafe_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rand___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_floor_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch__scaled_mm_cuda_float8_e4m3fn, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bool_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_char_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unbind_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isnan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hash_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isfinite_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isnan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_item_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int32, test/test_ops.py::TestTagsCUDA::test_tags___getitem___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rdiv___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___ror___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_T_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_double_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_float_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_all_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_or_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_block_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_geometric_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hypot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lcm_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log10_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_permute_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_pow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_remainder_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtri_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unfold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unsqueeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_where_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_acos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_all_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argsort_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argwhere_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_xor_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_clamp_max_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_column_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_contiguous_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagflat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flip_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_frexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_item_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lgamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cond_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_det_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_qr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_vecdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_cumsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_movedim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_multinomial_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_elu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pdist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softsign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_number_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pinverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize_as__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rsub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_cosine_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_slice_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_airy_ai_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_to_sparse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_ops.py::TestTagsCUDA::test_tags_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestTagsCUDA::test_tags_true_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_copy_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_div_floor_rounding_cuda_float32 2025-12-04T10:42:29.9076312Z 2025-12-04T10:42:29.9076461Z Finished test_ops 2/7 ... [2025-12-04 10:42:29.729023][214640.988402629], took 10.51min 2025-12-04T10:42:29.9077084Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T10:42:29.9077689Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:42:29.9077992Z Running test_ops 4/7 ... [2025-12-04 10:42:29.867484][214641.126861011] 2025-12-04T10:42:29.9078241Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:42:29.9078970Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=4', '--num-shards=7', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:42:29.867773] 2025-12-04T10:50:36.0010161Z 2025-12-04T10:50:36.0011115Z test_ops 4/7 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_4.7_e100005261f96076_.log 2025-12-04T10:50:36.0932723Z Running 4736 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_compare_cpu_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing__chunk_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_angle_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_partial_views_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nonzero_static_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rand_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_slice_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expand_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_istft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_select_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_square_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_acos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_arange_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_corrcoef_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_imag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_inner_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_tensorsolve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_median_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mode_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanmedian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_dropout_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_embedding_bag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_feature_alpha_dropout_without_train_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_trilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_circular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_reflect_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_outer_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resize__cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tensordot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_sparse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_v2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zero__cuda, test/test_ops.py::TestCommonCUDA::test_errors___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_errors__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_errors_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_errors_complex_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logcumsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_median_cuda, test/test_ops.py::TestCommonCUDA::test_errors_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_errors_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumulative_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_channel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_sparse_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___getitem___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rmul___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diff_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gather_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_reduction_with_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_msort_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_outer_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sort_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tril_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_permuted_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hash_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_msort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_rms_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning_H_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___getitem___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_float_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_any_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__softmax_backward_data_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argwhere_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cummax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_equal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gather_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ge_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_half_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_inner_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_4inputs_with_extra_args_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mT_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_matmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_kl_div_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_polygamma_special_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unique_consecutive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unique_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unravel_index_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_istft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pdist_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e5m2fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_all_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_aminmax_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumulative_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_permuted_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_lowrank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hash_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_where_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rmul___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rpow___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_half_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_int_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_alias_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isfinite_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isinf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_movedim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_repeat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rot90_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_std_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_transpose_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unbind_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addbmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_argwhere_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_asin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atleast_2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cdouble_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_eq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_int_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lstsq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_slogdet_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_svd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vecdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_not_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_long_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_matrix_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ne_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_permute_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rsqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sinc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_slice_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_with_sizes_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trapezoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tril_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___radd___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rpow___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__chunk_cat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_acos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_flipud_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_hstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_item_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_roll_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_transpose_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_view_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_view_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_angle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bfloat16_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_broadcast_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_inverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_corrcoef_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flipud_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_gather_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_half_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_put_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_inner_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lerp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logcumsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_not_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_normalize_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nonzero_static_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_fro_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_nuc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ormqr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pinverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rand_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_repeat_interleave_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_lowrank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_sparse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_transpose_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unbind_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_where_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___radd___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_bool_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_copysign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_gt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_hstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isnan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_xor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_minimum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_movedim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_randn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_roll_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_entr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sum_to_size_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tensor_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_transpose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unbind_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_all_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bfloat16_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cartesian_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cauchy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ceil_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_chalf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_embed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_double_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_float_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_float_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gather_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hypot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isneginf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_item_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ldexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_svdvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log1p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logaddexp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logcumsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_minimum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cross_entropy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_bag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_kl_div_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_prelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_qr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randint_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_remainder_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_renorm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_repeat_interleave_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_roll_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rot90_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sparse_mm_reduce_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_legendre_polynomial_p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_ndtri_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_xlog1py_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_triangular_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_true_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trunc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unique_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_vsplit_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hash_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isfinite_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_item_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__unsafe_masked_index_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_permute_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hash_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_uniform_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bool_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch__scaled_mm_cuda_float8_e4m3fn, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int8, test/test_ops.py::TestTagsCUDA::test_tags___radd___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_acosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atan2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atleast_1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bucketize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diag_embed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flip_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_gcd_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_heaviside_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_igamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isposinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log1p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_and_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal__in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_reciprocal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sgn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_erfcx_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_stft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_triu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unbind_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_vdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addcdiv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bfloat16_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bool_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_corrcoef_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cov_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cummax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diff_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_trunc_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_einsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_eq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_eye_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_fftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_hfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ihfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flipud_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_floor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_imag_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_kron_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eig_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigvals_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_factor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_svdvals_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_long_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ne_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nonzero_static_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randint_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_blackman_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_entr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i1e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_topk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_transpose_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unfold_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_where_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zeros_like_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_div_no_rounding_mode_cuda_float32 2025-12-04T10:50:36.1792162Z 2025-12-04T10:50:36.1792304Z Finished test_ops 4/7 ... [2025-12-04 10:50:36.005358][215127.264737047], took 8.10min 2025-12-04T10:50:36.1792925Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T10:50:36.1793521Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:50:36.1793856Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:50:36.1794128Z Uploading artifacts took 0.00 seconds 2025-12-04T10:50:36.1794358Z Running test_ops 6/7 ... [2025-12-04 10:50:36.143996][215127.403372458] 2025-12-04T10:50:36.1794596Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:50:36.1795313Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=6', '--num-shards=7', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:50:36.144286] 2025-12-04T10:59:20.9493106Z 2025-12-04T10:59:20.9494208Z test_ops 6/7 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_6.7_5b030732ed425e43_.log 2025-12-04T10:59:21.0677166Z Running 4927 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing___getitem___cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_half_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clone_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_stft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unsqueeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__unsafe_masked_index_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmm_decomposed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_aminmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_byte_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expand_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_item_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_2inputs_2outputs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_solve_triangular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_nearest-exact_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_logsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool3d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resolve_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_decimals_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_searchsorted_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_short_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_slice_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_topk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unique_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsqueeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_where_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zeros_like_cuda, test/test_ops.py::TestCommonCUDA::test_errors_arange_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_errors_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_errors_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_errors_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_errors_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_errors_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_lt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_errors_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_polar_cuda, test/test_ops.py::TestCommonCUDA::test_errors_pow_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_errors_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rpow___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hash_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hash_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mT_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_reflect_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___radd___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cummax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_4inputs_with_extra_args_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_unary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_reduction_no_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_cosine_embedding_loss_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pad_circular_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resolve_neg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_sparse_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_consecutive_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_static_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_he_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_addbmm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_out___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_t_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_bool_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_int_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unsqueeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_any_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_asin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hash_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_2inputs_2outputs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cond_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_det_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvalsh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_triangular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorsolve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_feature_alpha_dropout_with_train_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_feature_alpha_dropout_without_train_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_trilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_local_response_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_fro_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_quantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_neg_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_list_args_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_square_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_out_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float8_e4m3fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float8_e5m2, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_combinations_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unbind_copy_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_H_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___radd___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_long_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diag_embed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logaddexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_positive_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_pow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reshape_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tensor_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_permuted_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_float_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isinf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_item_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_binary_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cond_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_det_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logdet_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_xor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_unpack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_normalize_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_static_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_inf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ormqr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pca_lowrank_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_permute_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_qr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reciprocal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resize_as__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_square_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_t_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_take_along_dim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_take_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_to_sparse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_triangular_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unfold_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_uniform_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsafe_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_where_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_all_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isreal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log10_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_transpose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addbmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_alias_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_baddbmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_block_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_chalf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_physical_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cov_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diff_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_double_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_equal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_hstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_imag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_istft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_unary_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cond_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eigvalsh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_multi_dot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_qr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ne_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_normalize_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_constant_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_replicate_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_inf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_outer_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_randn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resize__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resize_as__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resolve_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rot90_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sigmoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_t_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trapz_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rsub___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__chunk_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_T_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_half_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_bucketize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clone_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erfc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eye_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_floor_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logaddexp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_masked_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reciprocal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_round_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i0e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_std_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_var_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__segment_reduce_lengths_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_aminmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_asinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_max_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_constant_pad_nd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_corrcoef_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diff_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_einsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_permuted_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gradient_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_heaviside_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cond_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_factor_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vecdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_not_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_xor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_normalize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_softmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_var_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_multinomial_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_dropout_backward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_celu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_elu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_linear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softplus_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_outer_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_permute_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rad2deg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sgn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sparse_sampled_addmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_airy_ai_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_std_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_t_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_take_along_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tensordot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_to_sparse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_topk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tril_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsafe_chunk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_mean_unbiased_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_where_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_xlogy_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__unsafe_masked_index_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_uniform_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_floor_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tril_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_unbind_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float64, test/test_ops.py::TestTagsCUDA::test_tags___rmatmul___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rmod___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rmul___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_long_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_short_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_addr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_contiguous_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_digamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fliplr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flipud_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_imag_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags__refs_isinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_le_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_minimum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_i0e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_trace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_vsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__unsafe_masked_index_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_acosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addbmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addcmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmm_decomposed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_aminmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_any_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_partial_views_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_asin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bernoulli_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_chalf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_char_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clone_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_digamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_permuted_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expm1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ihfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_floor_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_full_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_grid_sampler_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_int_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cholesky_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_slogdet_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logdet_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_unpack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matrix_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanquantile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_narrow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nextafter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_linear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_normalize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_inf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_nuc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ormqr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_quantile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resolve_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_searchsorted_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sgn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_j1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_y1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sum_to_size_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trapz_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_triu_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_view_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_xlogy_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_add_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_div_trunc_rounding_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_mul_cuda_float32 2025-12-04T10:59:21.1588906Z 2025-12-04T10:59:21.1589052Z Finished test_ops 6/7 ... [2025-12-04 10:59:20.954034][215652.213413596], took 8.75min 2025-12-04T10:59:21.1589670Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T10:59:21.1590269Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:59:21.1590570Z Running test_nn 1/1 ... [2025-12-04 10:59:21.093433][215652.35280845] 2025-12-04T10:59:21.1590813Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:59:21.1591363Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_nn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:59:21.093733] 2025-12-04T11:01:57.9944930Z 2025-12-04T11:01:57.9948610Z test_nn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_nn_1.1_4f0e5ece902956b0_.log 2025-12-04T11:01:58.0832597Z Running 2298 items in this shard: test/test_nn.py::TestNN::test_AdaptiveLogSoftmax, test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_reduce, test/test_nn.py::TestNN::test_BCELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_CELU_no_batch_dim, test/test_nn.py::TestNN::test_CELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_CTCLoss_critical_target_len, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cpu, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cuda, test/test_nn.py::TestNN::test_CTCLoss_long_targets, test/test_nn.py::TestNN::test_CTCLoss_typechecks, test/test_nn.py::TestNN::test_CTCLoss_zero_infinity, test/test_nn.py::TestNN::test_CTCLoss_zero_lengths, test/test_nn.py::TestNN::test_Conv1d, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_cuda, test/test_nn.py::TestNN::test_Conv1d_dilated, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda, test/test_nn.py::TestNN::test_Conv1d_groups, test/test_nn.py::TestNN::test_Conv1d_groups_cuda, test/test_nn.py::TestNN::test_Conv1d_pad1, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad1size1, test/test_nn.py::TestNN::test_Conv1d_pad1size1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad2, test/test_nn.py::TestNN::test_Conv1d_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_pad2size1, test/test_nn.py::TestNN::test_Conv1d_pad2size1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same, test/test_nn.py::TestNN::test_Conv1d_pad_same2, test/test_nn.py::TestNN::test_Conv1d_pad_same2_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_valid, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_stride, test/test_nn.py::TestNN::test_Conv1d_stride_cuda, test/test_nn.py::TestNN::test_Conv1d_zero_batch, test/test_nn.py::TestNN::test_Conv1d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise, test/test_nn.py::TestNN::test_Conv2d_depthwise_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier_cuda, test/test_nn.py::TestNN::test_Conv2d_dilated, test/test_nn.py::TestNN::test_Conv2d_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_groups, test/test_nn.py::TestNN::test_Conv2d_groups_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_thnn, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_no_bias, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_same, test/test_nn.py::TestNN::test_Conv2d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_valid, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv2d_padding, test/test_nn.py::TestNN::test_Conv2d_padding_cuda, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_strided, test/test_nn.py::TestNN::test_Conv2d_strided_cuda, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_zero_batch, test/test_nn.py::TestNN::test_Conv2d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_cuda, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d_cuda, test/test_nn.py::TestNN::test_Conv3d_dilated, test/test_nn.py::TestNN::test_Conv3d_dilated_cuda, test/test_nn.py::TestNN::test_Conv3d_dilated_strided, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda, test/test_nn.py::TestNN::test_Conv3d_groups, test/test_nn.py::TestNN::test_Conv3d_groups_cuda, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_no_bias, test/test_nn.py::TestNN::test_Conv3d_no_bias_cuda, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_same, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_valid, test/test_nn.py::TestNN::test_Conv3d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d_stride, test/test_nn.py::TestNN::test_Conv3d_stride_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_padding, test/test_nn.py::TestNN::test_Conv3d_stride_padding_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_zero_batch, test/test_nn.py::TestNN::test_Conv3d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d, test/test_nn.py::TestNN::test_ConvTranspose1d_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_groups, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d, test/test_nn.py::TestNN::test_ConvTranspose2d_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_groups, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose3d, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_CrossMapLRN2d, test/test_nn.py::TestNN::test_CrossMapLRN2d_cuda, test/test_nn.py::TestNN::test_ELU_no_batch_dim, test/test_nn.py::TestNN::test_ELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Embedding, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max, test/test_nn.py::TestNN::test_EmbeddingBag_max_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean, test/test_nn.py::TestNN::test_EmbeddingBag_mean_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sparse, test/test_nn.py::TestNN::test_EmbeddingBag_sparse_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum, test/test_nn.py::TestNN::test_EmbeddingBag_sum_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx_cuda, test/test_nn.py::TestNN::test_Embedding_cuda, test/test_nn.py::TestNN::test_Embedding_discontiguous, test/test_nn.py::TestNN::test_Embedding_discontiguous_cuda, test/test_nn.py::TestNN::test_Embedding_sparse, test/test_nn.py::TestNN::test_Embedding_sparse_cuda, test/test_nn.py::TestNN::test_Flatten, test/test_nn.py::TestNN::test_Flatten_cuda, test/test_nn.py::TestNN::test_Flatten_no_batch_dim, test/test_nn.py::TestNN::test_Flatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Fold, test/test_nn.py::TestNN::test_Fold_cuda, test/test_nn.py::TestNN::test_Fold_int_input, test/test_nn.py::TestNN::test_Fold_int_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input_cuda, test/test_nn.py::TestNN::test_GELU_no_batch_dim, test/test_nn.py::TestNN::test_GELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_GLU_no_batch_dim, test/test_nn.py::TestNN::test_GLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_HuberLoss_delta, test/test_nn.py::TestNN::test_HuberLoss_delta_cuda, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_reduce, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_LSTM_cell, test/test_nn.py::TestNN::test_LSTM_cell_forward_hidden_size, test/test_nn.py::TestNN::test_LSTM_cell_forward_input_size, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_cuda, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval_cuda, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear, test/test_nn.py::TestNN::test_Linear_cuda, test/test_nn.py::TestNN::test_Linear_no_batch_dim, test/test_nn.py::TestNN::test_Linear_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear_no_bias, test/test_nn.py::TestNN::test_Linear_no_bias_cuda, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_reduce, test/test_nn.py::TestNN::test_MSELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MaxUnpool1d_net, test/test_nn.py::TestNN::test_MaxUnpool1d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net, test/test_nn.py::TestNN::test_MaxUnpool2d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net, test/test_nn.py::TestNN::test_MaxUnpool3d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Mish_no_batch_dim, test/test_nn.py::TestNN::test_Mish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ModuleDict, test/test_nn.py::TestNN::test_ModuleList, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_reduce, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg_cuda, test/test_nn.py::TestNN::test_PReLU_backward_requires_grad_false, test/test_nn.py::TestNN::test_PReLU_no_batch_dim, test/test_nn.py::TestNN::test_PReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args_cuda, test/test_nn.py::TestNN::test_ParameterDict, test/test_nn.py::TestNN::test_ParameterDict_replication, test/test_nn.py::TestNN::test_ParameterList, test/test_nn.py::TestNN::test_ParameterList_meta, test/test_nn.py::TestNN::test_ParameterList_replication, test/test_nn.py::TestNN::test_PixelShuffle, test/test_nn.py::TestNN::test_PixelShuffle_cuda, test/test_nn.py::TestNN::test_PixelUnshuffle, test/test_nn.py::TestNN::test_PixelUnshuffle_cuda, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_RNN_cell, test/test_nn.py::TestNN::test_RNN_cell_forward_zero_hidden_size, test/test_nn.py::TestNN::test_RNN_cell_no_broadcasting, test/test_nn.py::TestNN::test_RNN_change_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_no_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_with_dropout, test/test_nn.py::TestNN::test_RNN_cudnn_weight_norm, test/test_nn.py::TestNN::test_RNN_dropout, test/test_nn.py::TestNN::test_RNN_dropout_state, test/test_nn.py::TestNN::test_RNN_input_size_zero, test/test_nn.py::TestNN::test_RNN_nonlinearity, test/test_nn.py::TestNN::test_RNN_nonlinearity_passed_as_arg, test/test_nn.py::TestNN::test_RReLU, test/test_nn.py::TestNN::test_RReLU_cuda, test/test_nn.py::TestNN::test_RReLU_no_batch_dim, test/test_nn.py::TestNN::test_RReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down, test/test_nn.py::TestNN::test_RReLU_with_up_down_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar_cuda, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReLU_no_batch_dim, test/test_nn.py::TestNN::test_ReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d, test/test_nn.py::TestNN::test_ReplicationPad3d_complex, test/test_nn.py::TestNN::test_ReplicationPad3d_complex_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SELU_no_batch_dim, test/test_nn.py::TestNN::test_SELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sequential_add, test/test_nn.py::TestNN::test_Sequential_append, test/test_nn.py::TestNN::test_Sequential_delitem, test/test_nn.py::TestNN::test_Sequential_extend, test/test_nn.py::TestNN::test_Sequential_getitem, test/test_nn.py::TestNN::test_Sequential_iadd, test/test_nn.py::TestNN::test_Sequential_imul, test/test_nn.py::TestNN::test_Sequential_insert, test/test_nn.py::TestNN::test_Sequential_insert_fail_case, test/test_nn.py::TestNN::test_Sequential_mul, test/test_nn.py::TestNN::test_Sequential_pop, test/test_nn.py::TestNN::test_Sequential_rmul, test/test_nn.py::TestNN::test_Sequential_setitem, test/test_nn.py::TestNN::test_Sequential_setitem_named, test/test_nn.py::TestNN::test_SiLU_no_batch_dim, test/test_nn.py::TestNN::test_SiLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_beta_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta_cuda, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_Softplus_no_batch_dim, test/test_nn.py::TestNN::test_Softplus_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softsign_no_batch_dim, test/test_nn.py::TestNN::test_Softsign_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanh_no_batch_dim, test/test_nn.py::TestNN::test_Tanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Threshold_no_batch_dim, test/test_nn.py::TestNN::test_Threshold_no_batch_dim_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation_cuda, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation_cuda, test/test_nn.py::TestNN::test_Transformer_cell, test/test_nn.py::TestNN::test_Transformer_multilayer_coder, test/test_nn.py::TestNN::test_Transformer_multilayer_coder_cuda, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Unfold, test/test_nn.py::TestNN::test_Unfold_cuda, test/test_nn.py::TestNN::test_Unfold_int_input, test/test_nn.py::TestNN::test_Unfold_int_input_cuda, test/test_nn.py::TestNN::test_adaptive_log_softmax, test/test_nn.py::TestNN::test_add_module, test/test_nn.py::TestNN::test_add_module_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_affine_grid, test/test_nn.py::TestNN::test_affine_grid_3d, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_2, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_3, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_2, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_3, test/test_nn.py::TestNN::test_affine_grid_error_checking, test/test_nn.py::TestNN::test_assignment, test/test_nn.py::TestNN::test_batch_norm_update_stats, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_NCHW_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_NCHW_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_NCHW_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_NCHW_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_NCHW_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_NCHW_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_NCHW_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_NCHW_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_NCHW_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_NCHW_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_NCHW_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_NCHW_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_native_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_buffer_update_when_stats_are_not_tracked, test/test_nn.py::TestNN::test_batchnorm_cudnn_half, test/test_nn.py::TestNN::test_batchnorm_cudnn_nhwc, test/test_nn.py::TestNN::test_batchnorm_half_overflow, test/test_nn.py::TestNN::test_batchnorm_load_state_dict, test/test_nn.py::TestNN::test_batchnorm_nhwc_cpu, test/test_nn.py::TestNN::test_batchnorm_nhwc_cuda, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_BatchNorm2d, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_SyncBatchNorm, test/test_nn.py::TestNN::test_batchnorm_nonaffine_cuda_half_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_bias_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_less_than_one_value_per_channel, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_mean_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_or_running_mean_have_forward_grad, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_weight_is_not_same_size_as_input, test/test_nn.py::TestNN::test_bce_loss_always_nonnegative, test/test_nn.py::TestNN::test_bce_loss_broadcasts_weights, test/test_nn.py::TestNN::test_bce_loss_input_range, test/test_nn.py::TestNN::test_bce_loss_size_mismatch, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_pos_weights, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_weights, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss_large_tensors_with_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_forward_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bce_with_logits_ones_in_pos_weights_are_the_same_as_none, test/test_nn.py::TestNN::test_bce_with_logits_raises_if_target_and_input_are_different_size, test/test_nn.py::TestNN::test_bce_with_logits_stability, test/test_nn.py::TestNN::test_bce_with_logits_with_pos_weight_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bilinear, test/test_nn.py::TestNN::test_bilinear_broadcasting, test/test_nn.py::TestNN::test_bilinear_no_bias, test/test_nn.py::TestNN::test_bilinear_non_contiguous, test/test_nn.py::TestNN::test_bilinear_value_error, test/test_nn.py::TestNN::test_broadcast_double_backwards_gpu, test/test_nn.py::TestNN::test_broadcast_no_grad, test/test_nn.py::TestNN::test_broadcast_not_requiring_grad, test/test_nn.py::TestNN::test_buffer_bad_module_subclass, test/test_nn.py::TestNN::test_buffer_not_persistent, test/test_nn.py::TestNN::test_buffer_not_persistent_assign, test/test_nn.py::TestNN::test_buffer_not_persistent_del, test/test_nn.py::TestNN::test_buffer_not_persistent_load, test/test_nn.py::TestNN::test_buffer_not_persistent_overwrite, test/test_nn.py::TestNN::test_buffers_and_named_buffers, test/test_nn.py::TestNN::test_call_supports_python_dict_output, test/test_nn.py::TestNN::test_channel_shuffle_input_checks, test/test_nn.py::TestNN::test_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_children, test/test_nn.py::TestNN::test_container_copy, test/test_nn.py::TestNN::test_convert_sync_batchnorm, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_diff_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_nonexpandable_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_invalid_shape, test/test_nn.py::TestNN::test_cosine_embedding_loss_margin_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_with_diff_type, test/test_nn.py::TestNN::test_cosine_similarity, test/test_nn.py::TestNN::test_cross_entropy_loss, test/test_nn.py::TestNN::test_cross_entropy_loss_precision, test/test_nn.py::TestNN::test_cross_entropy_loss_zero_div, test/test_nn.py::TestNN::test_cudnn_forward_exception, test/test_nn.py::TestNN::test_cudnn_rnn_dropout_states_device, test/test_nn.py::TestNN::test_cudnn_weight_format, test/test_nn.py::TestNN::test_cudnn_weight_tying, test/test_nn.py::TestNN::test_dir, test/test_nn.py::TestNN::test_dir_digit, test/test_nn.py::TestNN::test_elu_inplace_gradgrad, test/test_nn.py::TestNN::test_elu_inplace_on_view, test/test_nn.py::TestNN::test_error_RNN_seq_len_zero, test/test_nn.py::TestNN::test_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_get_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_set_extra_state, test/test_nn.py::TestNN::test_extra_state_non_dict, test/test_nn.py::TestNN::test_fb_fc_packed, test/test_nn.py::TestNN::test_flatten, test/test_nn.py::TestNN::test_fold_invalid_arg, test/test_nn.py::TestNN::test_fractional_max_pool2d_invalid_output_ratio, test/test_nn.py::TestNN::test_gaussian_nll_loss_args, test/test_nn.py::TestNN::test_gaussian_nll_loss_broadcasting, test/test_nn.py::TestNN::test_gaussian_nll_loss_scalar_var, test/test_nn.py::TestNN::test_get_buffer, test/test_nn.py::TestNN::test_get_buffer_from_submodules, test/test_nn.py::TestNN::test_getattr_with_property, test/test_nn.py::TestNN::test_grid_sample, test/test_nn.py::TestNN::test_grid_sample_3d, test/test_nn.py::TestNN::test_grid_sample_error_checking, test/test_nn.py::TestNN::test_grid_sample_nearest_neighbor_rounding_mode_consistency, test/test_nn.py::TestNN::test_hardtanh_backward, test/test_nn.py::TestNN::test_hardtanh_inplace_gradgrad, test/test_nn.py::TestNN::test_huber_loss_invalid_delta, test/test_nn.py::TestNN::test_inplace_thnn, test/test_nn.py::TestNN::test_interpolate, test/test_nn.py::TestNN::test_interpolate_bicubic_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_buffer_overflow, test/test_nn.py::TestNN::test_interpolate_illegal_memory_access, test/test_nn.py::TestNN::test_interpolate_linear_1d, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d, test/test_nn.py::TestNN::test_interpolate_nearest_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d, test/test_nn.py::TestNN::test_interpolate_nearest_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d, test/test_nn.py::TestNN::test_interpolate_nearest_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_undefined_behavior_casting, test/test_nn.py::TestNN::test_kl_div_log_softmax_target, test/test_nn.py::TestNN::test_kl_div_with_diff_type, test/test_nn.py::TestNN::test_kl_div_with_diff_type_log_target, test/test_nn.py::TestNN::test_l1_loss_correct, test/test_nn.py::TestNN::test_large_max_pool2d_ch_last, test/test_nn.py::TestNN::test_layer_norm_backwards_eps, test/test_nn.py::TestNN::test_layer_norm_eps, test/test_nn.py::TestNN::test_layer_norm_grads_with_create_graph_flag, test/test_nn.py::TestNN::test_layer_norm_large_tensor, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_broadcasting, test/test_nn.py::TestNN::test_linear_raise_on_scalar_input, test/test_nn.py::TestNN::test_log_softmax_dim0, test/test_nn.py::TestNN::test_log_softmax_dim0_cuda, test/test_nn.py::TestNN::test_log_softmax_dim3, test/test_nn.py::TestNN::test_log_softmax_dim3_cuda, test/test_nn.py::TestNN::test_log_softmax_lastdim, test/test_nn.py::TestNN::test_log_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_log_softmax_scalar, test/test_nn.py::TestNN::test_log_softmax_scalar_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial, test/test_nn.py::TestNN::test_log_softmax_spatial_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial_special, test/test_nn.py::TestNN::test_log_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_loss_equal_input_target_shape, test/test_nn.py::TestNN::test_margin_ranking_loss_margin_no_reduce, test/test_nn.py::TestNN::test_margin_ranking_loss_no_reduce, test/test_nn.py::TestNN::test_max_pool1d_invalid_output_size, test/test_nn.py::TestNN::test_module_apply_inplace_op, test/test_nn.py::TestNN::test_module_backcompat, test/test_nn.py::TestNN::test_module_super_init, test/test_nn.py::TestNN::test_module_to_argparse, test/test_nn.py::TestNN::test_modules, test/test_nn.py::TestNN::test_mse_loss_size_warning, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce_cuda, test/test_nn.py::TestNN::test_named_children, test/test_nn.py::TestNN::test_named_modules, test/test_nn.py::TestNN::test_named_parameters_remove_duplicate, test/test_nn.py::TestNN::test_native_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_nested_tensor_from_mask, test/test_nn.py::TestNN::test_nested_tensor_from_mask_error, test/test_nn.py::TestNN::test_no_grad, test/test_nn.py::TestNN::test_non_leaf_parameters, test/test_nn.py::TestNN::test_normalize, test/test_nn.py::TestNN::test_overwrite_module_params_on_conversion, test/test_nn.py::TestNN::test_pack_sequence_batch_sizes_throw, test/test_nn.py::TestNN::test_pad_scalar_error, test/test_nn.py::TestNN::test_padding_list, test/test_nn.py::TestNN::test_pairwise_distance, test/test_nn.py::TestNN::test_parameter_assignment, test/test_nn.py::TestNN::test_parameterlistdict_pickle, test/test_nn.py::TestNN::test_parameterlistdict_setting_attributes, test/test_nn.py::TestNN::test_parameters_and_named_parameters, test/test_nn.py::TestNN::test_parameters_to_vector, test/test_nn.py::TestNN::test_parse_to, test/test_nn.py::TestNN::test_partial_flat_weights, test/test_nn.py::TestNN::test_pdist, test/test_nn.py::TestNN::test_pdist_cpu_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_cuda_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_empty_col, test/test_nn.py::TestNN::test_pdist_empty_row, test/test_nn.py::TestNN::test_pdist_large, test/test_nn.py::TestNN::test_pdist_zeros, test/test_nn.py::TestNN::test_pickle_module_no_weights_only_warning, test/test_nn.py::TestNN::test_pixel_shuffle_nhwc_cpu, test/test_nn.py::TestNN::test_pixel_shuffle_unshuffle, test/test_nn.py::TestNN::test_pointwise_loss_broadcast, test/test_nn.py::TestNN::test_pointwise_loss_target_grad_none_reduction, test/test_nn.py::TestNN::test_projections_errors_on_gru_and_rnn, test/test_nn.py::TestNN::test_projections_lstm_args_check, test/test_nn.py::TestNN::test_projections_lstm_check_device, test/test_nn.py::TestNN::test_projections_lstm_initial_hidden_state, test/test_nn.py::TestNN::test_register_buffer_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_buffer_allows_tensor_like_object, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_not_tensor, test/test_nn.py::TestNN::test_register_parameter_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_relu_inplace_on_view, test/test_nn.py::TestNN::test_repr, test/test_nn.py::TestNN::test_requires_grad_, test/test_nn.py::TestNN::test_rnn_args_check, test/test_nn.py::TestNN::test_rnn_check_device, test/test_nn.py::TestNN::test_rnn_initial_hidden_state, test/test_nn.py::TestNN::test_rnn_weight_norm, test/test_nn.py::TestNN::test_set_submodule, test/test_nn.py::TestNN::test_share_memory, test/test_nn.py::TestNN::test_smoothl1loss_intergral_target, test/test_nn.py::TestNN::test_smoothl1loss_negative_beta_not_supported, test/test_nn.py::TestNN::test_softmax_functional_dim0, test/test_nn.py::TestNN::test_softmax_functional_dim0_cuda, test/test_nn.py::TestNN::test_softmax_functional_dim3, test/test_nn.py::TestNN::test_softmax_functional_dim3_cuda, test/test_nn.py::TestNN::test_softmax_functional_scalar, test/test_nn.py::TestNN::test_softmax_functional_scalar_cuda, test/test_nn.py::TestNN::test_softmax_lastdim, test/test_nn.py::TestNN::test_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_softmax_lastdim_dtype, test/test_nn.py::TestNN::test_softmax_lastdim_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial, test/test_nn.py::TestNN::test_softmax_spatial_cuda, test/test_nn.py::TestNN::test_softmax_spatial_dtype, test/test_nn.py::TestNN::test_softmax_spatial_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial_special, test/test_nn.py::TestNN::test_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_softmin, test/test_nn.py::TestNN::test_spectral_norm, test/test_nn.py::TestNN::test_spectral_norm_dim, test/test_nn.py::TestNN::test_spectral_norm_forward, test/test_nn.py::TestNN::test_spectral_norm_load_state_dict, test/test_nn.py::TestNN::test_spectral_norm_pickle, test/test_nn.py::TestNN::test_state_dict, test/test_nn.py::TestNN::test_swap_module_params_poisons_acc_grad, test/test_nn.py::TestNN::test_sync_batchnorm_accuracy_cuda, test/test_nn.py::TestNN::test_sync_batchnorm_backward_elemt, test/test_nn.py::TestNN::test_threshold_bfloat16_half, test/test_nn.py::TestNN::test_threshold_int, test/test_nn.py::TestNN::test_to, test/test_nn.py::TestNN::test_train_errors_for_invalid_mode, test/test_nn.py::TestNN::test_transformer_args_check, test/test_nn.py::TestNN::test_transformer_layer_args_check, test/test_nn.py::TestNN::test_transformerdecoder, test/test_nn.py::TestNN::test_transformerdecoderlayer, test/test_nn.py::TestNN::test_transformerdecoderlayer_gelu, test/test_nn.py::TestNN::test_triplet_margin_loss, test/test_nn.py::TestNN::test_triplet_margin_loss_no_reduce, test/test_nn.py::TestNN::test_triplet_margin_loss_swap, test/test_nn.py::TestNN::test_triplet_margin_loss_swap_no_reduce, test/test_nn.py::TestNN::test_type, test/test_nn.py::TestNN::test_unflatten, test/test_nn.py::TestNN::test_unflatten_invalid_arg, test/test_nn.py::TestNN::test_unfold_invalid_arg, test/test_nn.py::TestNN::test_upsamplingBilinear2d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingLinear1d, test/test_nn.py::TestNN::test_upsamplingLinear1d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingTrilinear3d_spatial_invariance, test/test_nn.py::TestNN::test_upsampling_bfloat16, test/test_nn.py::TestNN::test_upsampling_not_recompute_scale_factor, test/test_nn.py::TestNN::test_upsampling_small_scale, test/test_nn.py::TestNN::test_vector_to_parameters, test/test_nn.py::TestNN::test_weight_norm, test/test_nn.py::TestNN::test_weight_norm_pickle, test/test_nn.py::TestNN::test_weighted_huber_loss, test/test_nn.py::TestNN::test_weighted_l1_loss_with_weights, test/test_nn.py::TestNN::test_weighted_mse_loss, test/test_nn.py::TestNN::test_zero_grad, test/test_nn.py::TestFusionEval::test_fuse_module_eval_numerics, test/test_nn.py::TestConstantPadNd::test_constant_pad_nd, test/test_nn.py::TestConstantPadNd::test_preserves_memory_format, test/test_nn.py::TestAddRelu::test_add_relu, test/test_nn.py::TestAddRelu::test_add_relu_broadcasting, test/test_nn.py::TestFunctionalPickle::test_pickle_softsign, test/test_nn.py::TestFusionUtils::test_fuse_conv_bn_requires_grad, test/test_nn.py::TestFusionUtils::test_fuse_linear_bn_requires_grad, test/test_nn.py::TestUtils::test_consume_prefix_in_state_dict_if_present, test/test_nn.py::TestNNDeviceTypeCUDA::test_BatchNorm_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_Bilinear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_empty_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GRU_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_memory_format_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_raises_error_if_one_value_per_group_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm1d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm2d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm3d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LocalResponseNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_race_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_race_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_warnings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_deterministic_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_complex64, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_fails_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad1d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Transformer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Unfold_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_adaptiveavg_pool1d_shmem_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate45_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate90_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_3d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_update_stats_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_channel_shuffle_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_error_if_nonfinite_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_consistent_index_target_and_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_errors_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_weight_ignore_indices_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_with_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_index_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_one_hot_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_all_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_tensor_cpu_length_cuda_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_tensor_cuda_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_error_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cudnn_rnn_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_cudnn_rnn_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_device_mask_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_with_neg_alpha_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_fold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_glu_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_bfloat16_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardsigmoid_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_corner_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_corner_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_corner_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_for_single_spatial_element_during_training_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_less_than_one_value_per_channel_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_invalid_reduction_strings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_large_max_pool2d_ch_last_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_large_max_pool_contig_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_large_reflect_pad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_weight_bias_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_neg_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_zero_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_linear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_logsigmoid_out_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_lstmcell_backward_only_one_output_grad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_TxT_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_devices_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_forward_with_nans_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_mask_types_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_transformer_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_mish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_non_recursive_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_mse_loss_error_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_1d_input_1d_target_invalid_size_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_all_ignored_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_byte_target_matches_long_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_mean_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_none_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_sum_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_target_dim_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_mismatched_batch_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_out_of_bounds_ignore_index_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_total_weight_is_zero_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nonlinearity_propagate_nan_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_one_hot_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_overwrite_module_params_on_conversion_cpu_device_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_prelu_backward_32bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_replicatepad_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_epsilon_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_epsilon_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_epsilon_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_epsilon_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_numeric_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_numeric_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_rrelu_bounds_validation_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_save_lstm_compatibility_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_silu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_skip_init_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_vs_huber_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smoothl1loss_backward_zero_beta_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_smem_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_unaligned_grad_output_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_unaligned_output_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_without_fully_vectorized_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_bfloat16_half_to_float_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_double_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_forward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_low_threshold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_negative_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_threshold_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda_complex64, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_fast_path_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_default_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_correctness_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_fail_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_rocm_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_rescale_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsampling_64bit_indexing_channels_last_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsampling_64bit_indexing_channels_last_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingnearest2d_backward_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float32 2025-12-04T11:01:58.1465017Z 2025-12-04T11:01:58.1465158Z Finished test_nn 1/1 ... [2025-12-04 11:01:57.997614][215809.256993578], took 2.62min 2025-12-04T11:01:58.1465784Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T11:01:58.1466392Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:01:58.1466835Z Running test_binary_ufuncs 1/1 ... [2025-12-04 11:01:58.134010][215809.393385912] 2025-12-04T11:01:58.1467095Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:01:58.1467674Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_binary_ufuncs.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:01:58.134294] 2025-12-04T11:03:56.5919248Z 2025-12-04T11:03:56.5922908Z test_binary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_binary_ufuncs_1.1_c87cf15c11129ea9_.log 2025-12-04T11:03:56.9427106Z Running 12917 items in this shard: test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_broadcast_empty_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_with_tail_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_addcmul_scalars_as_floats_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_addsub_half_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_atan2_edgecases_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_op_mem_overlap_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_op_scalar_device_unspecified_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_ops_with_scalars_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bool_tensor_comparison_ops_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cdiv_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cmul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_div_underflow_overflow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_div_underflow_overflow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cpow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cpu_tensor_pow_cuda_scalar_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cremainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cross_device_binary_ops_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cross_device_inplace_error_msg_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_csub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cuda_tensor_pow_scalar_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cumulative_trapezoid_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_and_floordiv_script_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_and_floordiv_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divmul_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_exceptions_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_scalar_pow_float_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_scalar_pow_float_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_int_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_int_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_int_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_int_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_overflow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_overflow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_overflow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_overflow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_overflow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cross_device_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_idiv_and_ifloordiv_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_inplace_division_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_inplace_dunders_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_int_and_float_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_int_tensor_pow_neg_ints_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_ldexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cpu_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cpu_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_scalar_tensor_promotion_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_scalar_tensor_promotion_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_scalar_tensor_promotion_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_scalar_tensor_promotion_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_tensor_promotion_error_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_tensor_promotion_error_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_weight_tensor_promotion_error_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_with_nontrivial_alignment_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_long_tensor_pow_floats_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_cross_device_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_forward_ad_float32_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_chalf_tensor_and_cpu_scalar_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_bfloat16_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_out_resize_warning_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex_extremal_passing_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex_extremal_passing_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_inplace_resizing_exception_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_base_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_overloads_mem_overlap_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_type_promotion_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_fmod_large_dividend_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_fmod_large_dividend_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_overflow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rpow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_typing_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_tensor_pow_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_trapezoid_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_true_divide_out_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_true_divide_out_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___radd___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rand___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rdiv___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rmod___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rmul___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___ror___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rpow___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rsub___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rxor___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs__conversions_complex_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs__conversions_polar_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_left_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_right_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_clamp_max_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_clamp_min_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_copysign_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_floor_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_no_rounding_mode_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_trunc_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_eq_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_float_power_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_floor_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmax_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmin_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmod_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_gcd_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_ge_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_gt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_heaviside_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_hypot_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_igamma_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_igammac_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_isclose_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_lcm_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_le_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logaddexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_lt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_maximum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_minimum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_ne_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_nextafter_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_remainder_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_rsub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_special_xlog1py_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_special_zeta_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_sub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_true_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_xlogy_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_left_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_right_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_clamp_max_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_clamp_min_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_complex_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_copysign_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_floor_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_no_rounding_mode_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_trunc_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_eq_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_float_power_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_floor_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmax_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmin_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmod_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_gcd_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ge_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_gt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_heaviside_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_hypot_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_igamma_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_igammac_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_isclose_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_jiterator_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_jiterator_binary_return_by_ref_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_lcm_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ldexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_le_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logaddexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_lt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_max_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_maximum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_min_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_minimum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ne_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_nextafter_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_polar_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_remainder_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_rsub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_t_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_u_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_v_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_w_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_hermite_polynomial_h_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_hermite_polynomial_he_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_laguerre_polynomial_l_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_legendre_polynomial_p_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_t_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_u_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_v_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_w_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_xlog1py_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_zeta_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_sub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_true_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_xlogy_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_bfloat16_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_gradients_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_scalar_type_promotion_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_uint8 2025-12-04T11:03:57.2166749Z 2025-12-04T11:03:57.2166926Z Finished test_binary_ufuncs 1/1 ... [2025-12-04 11:03:56.603582][215927.862961226], took 1.97min 2025-12-04T11:03:57.2167575Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T11:03:57.2168198Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:03:57.2168524Z Running test_unary_ufuncs 1/1 ... [2025-12-04 11:03:56.737945][215927.997321299] 2025-12-04T11:03:57.2168786Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:03:57.2169535Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_unary_ufuncs.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:03:56.738223] 2025-12-04T11:07:37.6465998Z 2025-12-04T11:07:37.6466708Z test_unary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_unary_ufuncs_1.1_24c7d5d8d70eb20a_.log 2025-12-04T11:07:38.2159870Z Running 25078 items in this shard: test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_big_number_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_complex_edge_values_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_slow_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_assert_raises_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_inplace_integer_error_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_inplace_integer_error_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_integer_promotion_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_integer_promotion_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_bfloat16_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_narrow_dtypes_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_narrow_dtypes_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_empty_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_large_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_static_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_static_large_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_op_invert_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_polygamma_neg_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_tan_complex_cuda_matches_numpy_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_tanh_complex_cuda_matches_numpy_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_unary_out_op_mem_overlap_cuda_float64 2025-12-04T11:07:38.7626613Z 2025-12-04T11:07:38.7626804Z Finished test_unary_ufuncs 1/1 ... [2025-12-04 11:07:37.679452][216148.938824462], took 3.68min 2025-12-04T11:07:38.7627457Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_decompose_mem_bound_mm/inductor.test_decompose_mem_bound_mm-e91b2bf8a5c2cbfd.xml 2025-12-04T11:07:38.7628215Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:07:40.5885824Z Running test batch 'tests to run' cost 5493.36 seconds 2025-12-04T11:07:40.5886694Z Emitting td_test_failure_stats_v2 2025-12-04T11:07:40.5897126Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764846460_732b761cd10111f083e29e4e45b48bac 2025-12-04T11:07:42.6133294Z /var/lib/jenkins/pytorch/tools/stats/upload_metrics.py:156: UserWarning: Error uploading metric td_test_failure_stats_v2 to DynamoDB: Unable to locate credentials 2025-12-04T11:07:42.6134815Z warn(f"Error uploading metric {metric_name} to DynamoDB: {e}") 2025-12-04T11:07:42.6135444Z inductor/test_torchinductor 2/2 failed! 2025-12-04T11:07:43.1549439Z 2025-12-04T11:07:43.1549924Z real 91m38.972s 2025-12-04T11:07:43.1550317Z user 137m28.681s 2025-12-04T11:07:43.1550687Z sys 22m13.750s 2025-12-04T11:07:43.1551033Z + sccache_epilogue 2025-12-04T11:07:43.1551458Z + echo '::group::Sccache Compilation Log' 2025-12-04T11:07:43.1552364Z ##[group]Sccache Compilation Log 2025-12-04T11:07:43.1552896Z + echo '=================== sccache compilation log ===================' 2025-12-04T11:07:43.1553496Z =================== sccache compilation log =================== 2025-12-04T11:07:43.1554298Z + python /var/lib/jenkins/pytorch/.ci/pytorch/print_sccache_log.py /var/lib/jenkins/sccache_error.log 2025-12-04T11:07:43.1651479Z + echo '=========== If your build fails, please take a look at the log above for possible reasons ===========' 2025-12-04T11:07:43.1652584Z =========== If your build fails, please take a look at the log above for possible reasons =========== 2025-12-04T11:07:43.1653303Z + sccache --show-stats 2025-12-04T11:07:43.1670387Z Compile requests 1590 2025-12-04T11:07:43.1670943Z Compile requests executed 562 2025-12-04T11:07:43.1671245Z Cache hits 34 2025-12-04T11:07:43.1671497Z Cache hits (C/C++) 34 2025-12-04T11:07:43.1671739Z Cache misses 528 2025-12-04T11:07:43.1671976Z Cache misses (C/C++) 522 2025-12-04T11:07:43.1672215Z Cache misses (HIP) 6 2025-12-04T11:07:43.1672468Z Cache hits rate 6.05 % 2025-12-04T11:07:43.1672737Z Cache hits rate (C/C++) 6.12 % 2025-12-04T11:07:43.1672982Z Cache hits rate (HIP) 0.00 % 2025-12-04T11:07:43.1673217Z Cache timeouts 0 2025-12-04T11:07:43.1673446Z Cache read errors 0 2025-12-04T11:07:43.1673671Z Forced recaches 0 2025-12-04T11:07:43.1673962Z Cache write errors 0 2025-12-04T11:07:43.1674399Z Cache errors 0 2025-12-04T11:07:43.1674822Z Compilations 528 2025-12-04T11:07:43.1675243Z Compilation failures 0 2025-12-04T11:07:43.1675681Z Non-cacheable compilations 0 2025-12-04T11:07:43.1676122Z Non-cacheable calls 280 2025-12-04T11:07:43.1676566Z Non-compilation calls 748 2025-12-04T11:07:43.1677010Z Unsupported compiler calls 0 2025-12-04T11:07:43.1677454Z Average cache write 0.000 s 2025-12-04T11:07:43.1677908Z Average compiler 2.231 s 2025-12-04T11:07:43.1678355Z Average cache read hit 0.000 s 2025-12-04T11:07:43.1678813Z Failed distributed compilations 0 2025-12-04T11:07:43.1679117Z 2025-12-04T11:07:43.1679271Z Non-cacheable reasons: 2025-12-04T11:07:43.1679654Z unknown source language 255 2025-12-04T11:07:43.1680473Z -E 25 2025-12-04T11:07:43.1680933Z 2025-12-04T11:07:43.1681232Z Cache location Local disk: "/var/lib/jenkins/.cache/sccache" 2025-12-04T11:07:43.1681850Z Use direct/preprocessor mode? yes 2025-12-04T11:07:43.1682307Z Version (client) 0.10.0 2025-12-04T11:07:43.1682748Z Cache size 34 MiB 2025-12-04T11:07:43.1683202Z Max cache size 10 GiB 2025-12-04T11:07:43.1683818Z + sccache --stop-server 2025-12-04T11:07:43.1687169Z Stopping sccache server... 2025-12-04T11:07:43.1689215Z Compile requests 1590 2025-12-04T11:07:43.1689728Z Compile requests executed 562 2025-12-04T11:07:43.1690166Z Cache hits 34 2025-12-04T11:07:43.1690598Z Cache hits (C/C++) 34 2025-12-04T11:07:43.1691024Z Cache misses 528 2025-12-04T11:07:43.1691439Z Cache misses (C/C++) 522 2025-12-04T11:07:43.1691848Z Cache misses (HIP) 6 2025-12-04T11:07:43.1692285Z Cache hits rate 6.05 % 2025-12-04T11:07:43.1692741Z Cache hits rate (C/C++) 6.12 % 2025-12-04T11:07:43.1693298Z Cache hits rate (HIP) 0.00 % 2025-12-04T11:07:43.1693863Z Cache timeouts 0 2025-12-04T11:07:43.1694296Z Cache read errors 0 2025-12-04T11:07:43.1694647Z Forced recaches 0 2025-12-04T11:07:43.1694907Z Cache write errors 0 2025-12-04T11:07:43.1695228Z Cache errors 0 2025-12-04T11:07:43.1695594Z Compilations 528 2025-12-04T11:07:43.1695987Z Compilation failures 0 2025-12-04T11:07:43.1696483Z Non-cacheable compilations 0 2025-12-04T11:07:43.1696796Z Non-cacheable calls 280 2025-12-04T11:07:43.1708178Z Non-compilation calls 748 2025-12-04T11:07:43.1708583Z Unsupported compiler calls 0 2025-12-04T11:07:43.1708918Z Average cache write 0.000 s 2025-12-04T11:07:43.1709281Z Average compiler 2.231 s 2025-12-04T11:07:43.1709659Z Average cache read hit 0.000 s 2025-12-04T11:07:43.1709917Z Failed distributed compilations 0 2025-12-04T11:07:43.1710080Z 2025-12-04T11:07:43.1710170Z Non-cacheable reasons: 2025-12-04T11:07:43.1710381Z unknown source language 255 2025-12-04T11:07:43.1710618Z -E 25 2025-12-04T11:07:43.1710773Z 2025-12-04T11:07:43.1710934Z Cache location Local disk: "/var/lib/jenkins/.cache/sccache" 2025-12-04T11:07:43.1711270Z Use direct/preprocessor mode? yes 2025-12-04T11:07:43.1711615Z Version (client) 0.10.0 2025-12-04T11:07:43.1711910Z Cache size 34 MiB 2025-12-04T11:07:43.1712187Z Max cache size 10 GiB 2025-12-04T11:07:43.1712443Z + echo ::endgroup:: 2025-12-04T11:07:43.1712804Z ##[endgroup] 2025-12-04T11:07:43.1747951Z ##[error]Process completed with exit code 1. 2025-12-04T11:07:43.1783263Z ##[group]Run # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2025-12-04T11:07:43.1783818Z # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2025-12-04T11:07:43.1784453Z docker exec -t "22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2025-12-04T11:07:43.1796322Z shell: /usr/bin/bash -e {0} 2025-12-04T11:07:43.1796510Z env: 2025-12-04T11:07:43.1796659Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:43.1796927Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:43.1797310Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:43.1797663Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:43.1798291Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:43.1798936Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:43.1799120Z AWS_REGION: us-east-1 2025-12-04T11:07:43.1799337Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:43.1799578Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:43.1803018Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:43.1803298Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:43.1803595Z ##[endgroup] 2025-12-04T11:07:43.2523107Z ##[group]Run cat test/**/*_toprint.log || true 2025-12-04T11:07:43.2523672Z cat test/**/*_toprint.log || true 2025-12-04T11:07:43.2547849Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T11:07:43.2548394Z env: 2025-12-04T11:07:43.2548707Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:43.2549283Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:43.2550106Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:43.2550887Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:43.2552219Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:43.2553422Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:43.2553816Z AWS_REGION: us-east-1 2025-12-04T11:07:43.2554302Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:43.2554823Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:43.2562604Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:43.2563402Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:43.2564053Z ##[endgroup] 2025-12-04T11:07:43.2686413Z cat: 'test/**/*_toprint.log': No such file or directory 2025-12-04T11:07:43.2768080Z Prepare all required actions 2025-12-04T11:07:43.2768485Z Getting action download info 2025-12-04T11:07:43.5533446Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-12-04T11:07:44.0719173Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T11:07:44.7277202Z ##[group]Run ./.github/actions/upload-test-artifacts 2025-12-04T11:07:44.7277425Z with: 2025-12-04T11:07:44.7277566Z use-gha: true 2025-12-04T11:07:44.7277784Z file-suffix: test-default-2-2-linux.rocm.gpu.gfx1100_57118756853 2025-12-04T11:07:44.7278044Z s3-bucket: gha-artifacts 2025-12-04T11:07:44.7278201Z env: 2025-12-04T11:07:44.7278332Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:44.7278583Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:44.7278933Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:44.7279279Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:44.7279853Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:44.7280372Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:44.7280544Z AWS_REGION: us-east-1 2025-12-04T11:07:44.7280735Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:44.7280963Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:44.7284198Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:44.7284454Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:44.7284729Z ##[endgroup] 2025-12-04T11:07:44.7333219Z ##[group]Run actions/upload-artifact@v4 2025-12-04T11:07:44.7333433Z with: 2025-12-04T11:07:44.7333726Z name: test-jsons-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip 2025-12-04T11:07:44.7334065Z retention-days: 14 2025-12-04T11:07:44.7334232Z if-no-files-found: warn 2025-12-04T11:07:44.7334412Z path: test/**/*.json 2025-12-04T11:07:44.7334683Z compression-level: 6 2025-12-04T11:07:44.7334844Z overwrite: false 2025-12-04T11:07:44.7335004Z include-hidden-files: false 2025-12-04T11:07:44.7335180Z env: 2025-12-04T11:07:44.7335322Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:44.7335593Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:44.7335982Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:44.7336345Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:44.7337000Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:44.7337590Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:44.7337773Z AWS_REGION: us-east-1 2025-12-04T11:07:44.7337974Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:44.7338213Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:44.7341936Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:44.7342219Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:44.7342521Z ##[endgroup] 2025-12-04T11:07:45.2489664Z With the provided path, there will be 6 files uploaded 2025-12-04T11:07:45.2494221Z Artifact name is valid! 2025-12-04T11:07:45.2494993Z Root directory input is valid! 2025-12-04T11:07:46.6532411Z Beginning upload of artifact content to blob storage 2025-12-04T11:07:46.8499017Z Uploaded bytes 42780 2025-12-04T11:07:46.8875836Z Finished uploading artifact content to blob storage! 2025-12-04T11:07:46.8876876Z SHA256 digest of uploaded artifact zip is 5651599beaea6fc700e22796d445bad59515175e23bd57dfb65a2f970f3471be 2025-12-04T11:07:46.8878291Z Finalizing artifact upload 2025-12-04T11:07:47.0124104Z Artifact test-jsons-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip.zip successfully finalized. Artifact ID 4763041970 2025-12-04T11:07:47.0125992Z Artifact test-jsons-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip has been successfully uploaded! Final size is 42780 bytes. Artifact ID is 4763041970 2025-12-04T11:07:47.0130689Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19923123999/artifacts/4763041970 2025-12-04T11:07:47.0294054Z ##[group]Run actions/upload-artifact@v4 2025-12-04T11:07:47.0294546Z with: 2025-12-04T11:07:47.0295217Z name: test-reports-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip 2025-12-04T11:07:47.0295999Z retention-days: 14 2025-12-04T11:07:47.0296398Z if-no-files-found: ignore 2025-12-04T11:07:47.0296838Z path: test/**/*.xml test/**/*.csv 2025-12-04T11:07:47.0297326Z compression-level: 6 2025-12-04T11:07:47.0297710Z overwrite: false 2025-12-04T11:07:47.0298106Z include-hidden-files: false 2025-12-04T11:07:47.0298512Z env: 2025-12-04T11:07:47.0298860Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:47.0299471Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:47.0300366Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:47.0301194Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:47.0302628Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:47.0303939Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:47.0304377Z AWS_REGION: us-east-1 2025-12-04T11:07:47.0304869Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:47.0305434Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:47.0313640Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:47.0314284Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:47.0314977Z ##[endgroup] 2025-12-04T11:07:47.6229956Z With the provided path, there will be 22 files uploaded 2025-12-04T11:07:47.6234429Z Artifact name is valid! 2025-12-04T11:07:47.6235294Z Root directory input is valid! 2025-12-04T11:07:49.0509771Z Beginning upload of artifact content to blob storage 2025-12-04T11:07:49.6645091Z Uploaded bytes 1333125 2025-12-04T11:07:49.7047574Z Finished uploading artifact content to blob storage! 2025-12-04T11:07:49.7052690Z SHA256 digest of uploaded artifact zip is b27085381a229034abf77b0d7661c8e42f6027dca1a1ab4fc58b0de882c703ee 2025-12-04T11:07:49.7055096Z Finalizing artifact upload 2025-12-04T11:07:49.8225322Z Artifact test-reports-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip.zip successfully finalized. Artifact ID 4763042463 2025-12-04T11:07:49.8227298Z Artifact test-reports-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip has been successfully uploaded! Final size is 1333125 bytes. Artifact ID is 4763042463 2025-12-04T11:07:49.8234729Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19923123999/artifacts/4763042463 2025-12-04T11:07:49.8526960Z ##[group]Run actions/upload-artifact@v4 2025-12-04T11:07:49.8527472Z with: 2025-12-04T11:07:49.8528096Z name: logs-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip 2025-12-04T11:07:49.8528822Z retention-days: 14 2025-12-04T11:07:49.8529230Z if-no-files-found: ignore 2025-12-04T11:07:49.8529694Z path: usage_log.txt test/**/*.log 2025-12-04T11:07:49.8530167Z compression-level: 6 2025-12-04T11:07:49.8530558Z overwrite: false 2025-12-04T11:07:49.8530945Z include-hidden-files: false 2025-12-04T11:07:49.8531373Z env: 2025-12-04T11:07:49.8531706Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:49.8532316Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:49.8533407Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:49.8534260Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:49.8536081Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:49.8537413Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:49.8537859Z AWS_REGION: us-east-1 2025-12-04T11:07:49.8538352Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:49.8538933Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:49.8547194Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:49.8547863Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:49.8548572Z ##[endgroup] 2025-12-04T11:07:50.4661040Z Multiple search paths detected. Calculating the least common ancestor of all paths 2025-12-04T11:07:50.4662833Z The least common ancestor is /home/pytorchci/actions-runner/_work/pytorch/pytorch. This will be the root directory of the artifact 2025-12-04T11:07:50.4663415Z With the provided path, there will be 16 files uploaded 2025-12-04T11:07:50.4666717Z Artifact name is valid! 2025-12-04T11:07:50.4667156Z Root directory input is valid! 2025-12-04T11:07:51.8933194Z Beginning upload of artifact content to blob storage 2025-12-04T11:07:52.3843650Z Uploaded bytes 656369 2025-12-04T11:07:52.4267588Z Finished uploading artifact content to blob storage! 2025-12-04T11:07:52.4272835Z SHA256 digest of uploaded artifact zip is 6100a5781b2cacc107953ba48cd61261f0f484c3eb876e9a4c2b1cee4b40b4ae 2025-12-04T11:07:52.4275390Z Finalizing artifact upload 2025-12-04T11:07:52.5784149Z Artifact logs-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip.zip successfully finalized. Artifact ID 4763042959 2025-12-04T11:07:52.5786179Z Artifact logs-runattempt1-test-default-2-2-linux.rocm.gpu.gfx1100_57118756853.zip has been successfully uploaded! Final size is 656369 bytes. Artifact ID is 4763042959 2025-12-04T11:07:52.5787896Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19923123999/artifacts/4763042959 2025-12-04T11:07:52.6042870Z ##[group]Run # shellcheck disable=SC2156 2025-12-04T11:07:52.6043500Z # shellcheck disable=SC2156 2025-12-04T11:07:52.6044585Z find . -iname "core.[1-9]*" -exec docker exec "${CONTAINER_NAME}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2025-12-04T11:07:52.6070490Z shell: /usr/bin/bash -e {0} 2025-12-04T11:07:52.6070961Z env: 2025-12-04T11:07:52.6071323Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:52.6071984Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:52.6072917Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:52.6073793Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:52.6075262Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:52.6076663Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:52.6077125Z AWS_REGION: us-east-1 2025-12-04T11:07:52.6077639Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:52.6078232Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:52.6086589Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:52.6087254Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:52.6087973Z ##[endgroup] 2025-12-04T11:07:52.8930016Z ##[group]Run actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 2025-12-04T11:07:52.8930752Z with: 2025-12-04T11:07:52.8931227Z name: coredumps-default-2-2-linux.rocm.gpu.gfx1100 2025-12-04T11:07:52.8931797Z retention-days: 14 2025-12-04T11:07:52.8932201Z if-no-files-found: ignore 2025-12-04T11:07:52.8932649Z path: ./**/core.[1-9]* 2025-12-04T11:07:52.8933066Z compression-level: 6 2025-12-04T11:07:52.8933700Z overwrite: false 2025-12-04T11:07:52.8934094Z include-hidden-files: false 2025-12-04T11:07:52.8934519Z env: 2025-12-04T11:07:52.8934870Z GIT_DEFAULT_BRANCH: main 2025-12-04T11:07:52.8935501Z RUNNER_ARTIFACT_DIR: /home/pytorchci/actions-runner/_work/_temp/artifacts 2025-12-04T11:07:52.8936414Z RUNNER_TEST_RESULTS_DIR: /home/pytorchci/actions-runner/_work/_temp/test-results 2025-12-04T11:07:52.8937273Z RUNNER_DOCS_DIR: /home/pytorchci/actions-runner/_work/_temp/docs 2025-12-04T11:07:52.8938768Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device /dev/dri --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T11:07:52.8940077Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T11:07:52.8940537Z AWS_REGION: us-east-1 2025-12-04T11:07:52.8941056Z AWS_ACCESS_KEY_ID: *** 2025-12-04T11:07:52.8941655Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T11:07:52.8949939Z AWS_SESSION_TOKEN: *** 2025-12-04T11:07:52.8950628Z CONTAINER_NAME: 22fb02850636b6e80610c15f11fede09951d580736da76fb3ce4c86b6fc77752 2025-12-04T11:07:52.8951355Z ##[endgroup] 2025-12-04T11:07:58.2987521Z No files were found with the provided path: ./**/core.[1-9]*. No artifacts will be uploaded. 2025-12-04T11:07:58.3276355Z Post job cleanup. 2025-12-04T11:07:58.3323547Z Post job cleanup. 2025-12-04T11:07:58.3619175Z Logging out of registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T11:07:58.3923651Z Post job cleanup. 2025-12-04T11:07:58.4933479Z Post job cleanup. 2025-12-04T11:07:58.5036150Z Post job cleanup. 2025-12-04T11:07:58.5749063Z [command]/usr/bin/git version 2025-12-04T11:07:58.5777984Z git version 2.34.1 2025-12-04T11:07:58.5804065Z Copying '/home/pytorchci/.gitconfig' to '/home/pytorchci/actions-runner/_work/_temp/62c51fb5-6968-4ca1-812b-8b8e732e7231/.gitconfig' 2025-12-04T11:07:58.5812862Z Temporarily overriding HOME='/home/pytorchci/actions-runner/_work/_temp/62c51fb5-6968-4ca1-812b-8b8e732e7231' before making global git config changes 2025-12-04T11:07:58.5813453Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T11:07:58.5816164Z [command]/usr/bin/git config --global --add safe.directory /home/pytorchci/actions-runner/_work/pytorch/pytorch 2025-12-04T11:07:58.5839468Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T11:07:58.5868795Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T11:07:58.6149783Z Entering 'android/libs/fbjni' 2025-12-04T11:07:58.6201164Z Entering 'third_party/FP16' 2025-12-04T11:07:58.6251646Z Entering 'third_party/FXdiv' 2025-12-04T11:07:58.6290772Z Entering 'third_party/NNPACK' 2025-12-04T11:07:58.6326607Z Entering 'third_party/NVTX' 2025-12-04T11:07:58.6371678Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T11:07:58.6405919Z Entering 'third_party/XNNPACK' 2025-12-04T11:07:58.6459113Z Entering 'third_party/aiter' 2025-12-04T11:07:58.6497303Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T11:07:58.6555049Z Entering 'third_party/benchmark' 2025-12-04T11:07:58.6603273Z Entering 'third_party/composable_kernel' 2025-12-04T11:07:58.6659085Z Entering 'third_party/cpp-httplib' 2025-12-04T11:07:58.6698917Z Entering 'third_party/cpuinfo' 2025-12-04T11:07:58.6743293Z Entering 'third_party/cudnn_frontend' 2025-12-04T11:07:58.6786176Z Entering 'third_party/cutlass' 2025-12-04T11:07:58.6834886Z Entering 'third_party/fbgemm' 2025-12-04T11:07:58.6889799Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T11:07:58.6934354Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T11:07:58.6984953Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T11:07:58.7027385Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T11:07:58.7076385Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T11:07:58.7118312Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T11:07:58.7164638Z Entering 'third_party/fbgemm/external/json' 2025-12-04T11:07:58.7216802Z Entering 'third_party/flash-attention' 2025-12-04T11:07:58.7258621Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T11:07:58.7310422Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T11:07:58.7359436Z Entering 'third_party/flatbuffers' 2025-12-04T11:07:58.7401433Z Entering 'third_party/fmt' 2025-12-04T11:07:58.7449875Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T11:07:58.7505770Z Entering 'third_party/gloo' 2025-12-04T11:07:58.7556530Z Entering 'third_party/googletest' 2025-12-04T11:07:58.7610610Z Entering 'third_party/ideep' 2025-12-04T11:07:58.7657151Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T11:07:58.7712031Z Entering 'third_party/ittapi' 2025-12-04T11:07:58.7749137Z Entering 'third_party/kineto' 2025-12-04T11:07:58.7789456Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T11:07:58.7836054Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T11:07:58.7878783Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T11:07:58.7914444Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T11:07:58.7956563Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T11:07:58.7988849Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T11:07:58.8057615Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T11:07:58.8105392Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T11:07:58.8158675Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T11:07:58.8207332Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T11:07:58.8256923Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T11:07:58.8285768Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T11:07:58.8349206Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T11:07:58.8405858Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T11:07:58.8442849Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T11:07:58.8483634Z Entering 'third_party/kleidiai' 2025-12-04T11:07:58.8534326Z Entering 'third_party/mimalloc' 2025-12-04T11:07:58.8580740Z Entering 'third_party/nlohmann' 2025-12-04T11:07:58.8628702Z Entering 'third_party/onnx' 2025-12-04T11:07:58.8691998Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T11:07:58.8743825Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T11:07:58.8780368Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T11:07:58.8823932Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T11:07:58.8869075Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T11:07:58.8909985Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T11:07:58.8949462Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T11:07:58.8988932Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T11:07:58.9024152Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T11:07:58.9056095Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T11:07:58.9099507Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T11:07:58.9147275Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T11:07:58.9208826Z Entering 'third_party/pocketfft' 2025-12-04T11:07:58.9260688Z Entering 'third_party/protobuf' 2025-12-04T11:07:58.9306504Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T11:07:58.9345927Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T11:07:58.9384282Z Entering 'third_party/psimd' 2025-12-04T11:07:58.9434900Z Entering 'third_party/pthreadpool' 2025-12-04T11:07:58.9480656Z Entering 'third_party/pybind11' 2025-12-04T11:07:58.9524738Z Entering 'third_party/python-peachpy' 2025-12-04T11:07:58.9564124Z Entering 'third_party/sleef' 2025-12-04T11:07:58.9604768Z Entering 'third_party/tensorpipe' 2025-12-04T11:07:58.9643777Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T11:07:58.9692053Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T11:07:58.9738921Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T11:07:58.9775717Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T11:07:58.9816658Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T11:07:58.9890030Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T11:07:58.9920457Z http.https://github.com/.extraheader 2025-12-04T11:07:58.9936562Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-12-04T11:07:58.9975769Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T11:07:59.0241774Z Entering 'android/libs/fbjni' 2025-12-04T11:07:59.0273704Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0311551Z Entering 'third_party/FP16' 2025-12-04T11:07:59.0339399Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0375671Z Entering 'third_party/FXdiv' 2025-12-04T11:07:59.0404902Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0440073Z Entering 'third_party/NNPACK' 2025-12-04T11:07:59.0462140Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0501113Z Entering 'third_party/NVTX' 2025-12-04T11:07:59.0529539Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0561438Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T11:07:59.0582730Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0622063Z Entering 'third_party/XNNPACK' 2025-12-04T11:07:59.0656371Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0700145Z Entering 'third_party/aiter' 2025-12-04T11:07:59.0729837Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0758383Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T11:07:59.0781992Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0833563Z Entering 'third_party/benchmark' 2025-12-04T11:07:59.0862488Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0897972Z Entering 'third_party/composable_kernel' 2025-12-04T11:07:59.0929750Z http.https://github.com/.extraheader 2025-12-04T11:07:59.0967805Z Entering 'third_party/cpp-httplib' 2025-12-04T11:07:59.1001568Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1034441Z Entering 'third_party/cpuinfo' 2025-12-04T11:07:59.1058765Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1094970Z Entering 'third_party/cudnn_frontend' 2025-12-04T11:07:59.1130074Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1165570Z Entering 'third_party/cutlass' 2025-12-04T11:07:59.1193858Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1233761Z Entering 'third_party/fbgemm' 2025-12-04T11:07:59.1263222Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1301080Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T11:07:59.1331967Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1370555Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T11:07:59.1392105Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1433338Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T11:07:59.1462603Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1500085Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T11:07:59.1522143Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1571880Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T11:07:59.1598749Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1636746Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T11:07:59.1663304Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1698446Z Entering 'third_party/fbgemm/external/json' 2025-12-04T11:07:59.1731351Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1772175Z Entering 'third_party/flash-attention' 2025-12-04T11:07:59.1796267Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1825101Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T11:07:59.1852557Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1884498Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T11:07:59.1910785Z http.https://github.com/.extraheader 2025-12-04T11:07:59.1979379Z Entering 'third_party/flatbuffers' 2025-12-04T11:07:59.2002855Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2049597Z Entering 'third_party/fmt' 2025-12-04T11:07:59.2082381Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2117211Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T11:07:59.2142056Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2176875Z Entering 'third_party/gloo' 2025-12-04T11:07:59.2209881Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2251486Z Entering 'third_party/googletest' 2025-12-04T11:07:59.2276790Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2312434Z Entering 'third_party/ideep' 2025-12-04T11:07:59.2341916Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2376517Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T11:07:59.2404794Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2455594Z Entering 'third_party/ittapi' 2025-12-04T11:07:59.2488201Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2528520Z Entering 'third_party/kineto' 2025-12-04T11:07:59.2561327Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2598521Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T11:07:59.2624912Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2673586Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T11:07:59.2690693Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2725811Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T11:07:59.2757931Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2799911Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T11:07:59.2833211Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2873518Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T11:07:59.2894966Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2930335Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T11:07:59.2954748Z http.https://github.com/.extraheader 2025-12-04T11:07:59.2991621Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T11:07:59.3012438Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3045763Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T11:07:59.3069851Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3098442Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T11:07:59.3123155Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3159161Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T11:07:59.3181301Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3217746Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T11:07:59.3238928Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3276269Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T11:07:59.3309110Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3346369Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T11:07:59.3365937Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3405006Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T11:07:59.3433249Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3476247Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T11:07:59.3495385Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3531618Z Entering 'third_party/kleidiai' 2025-12-04T11:07:59.3561566Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3599898Z Entering 'third_party/mimalloc' 2025-12-04T11:07:59.3627908Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3663509Z Entering 'third_party/nlohmann' 2025-12-04T11:07:59.3685943Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3718161Z Entering 'third_party/onnx' 2025-12-04T11:07:59.3749335Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3817174Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T11:07:59.3848940Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3892013Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T11:07:59.3917915Z http.https://github.com/.extraheader 2025-12-04T11:07:59.3953534Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T11:07:59.3978808Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4013804Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T11:07:59.4038288Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4082396Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T11:07:59.4108377Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4150725Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T11:07:59.4173763Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4211290Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T11:07:59.4241279Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4280428Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T11:07:59.4307206Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4340314Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T11:07:59.4369514Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4409221Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T11:07:59.4437197Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4471709Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T11:07:59.4497880Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4536805Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T11:07:59.4559154Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4612924Z Entering 'third_party/pocketfft' 2025-12-04T11:07:59.4644813Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4681280Z Entering 'third_party/protobuf' 2025-12-04T11:07:59.4708320Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4744436Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T11:07:59.4776005Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4812549Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T11:07:59.4834995Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4876394Z Entering 'third_party/psimd' 2025-12-04T11:07:59.4906600Z http.https://github.com/.extraheader 2025-12-04T11:07:59.4942834Z Entering 'third_party/pthreadpool' 2025-12-04T11:07:59.4973536Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5011242Z Entering 'third_party/pybind11' 2025-12-04T11:07:59.5031018Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5067338Z Entering 'third_party/python-peachpy' 2025-12-04T11:07:59.5089742Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5134850Z Entering 'third_party/sleef' 2025-12-04T11:07:59.5164295Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5197303Z Entering 'third_party/tensorpipe' 2025-12-04T11:07:59.5215349Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5251582Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T11:07:59.5289072Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5322747Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T11:07:59.5350595Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5390788Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T11:07:59.5414347Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5451272Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T11:07:59.5478361Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5505056Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T11:07:59.5530099Z http.https://github.com/.extraheader 2025-12-04T11:07:59.5600431Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.5628728Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T11:07:59.5886486Z Entering 'android/libs/fbjni' 2025-12-04T11:07:59.5907410Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T11:07:59.5934456Z Entering 'third_party/FP16' 2025-12-04T11:07:59.5955948Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T11:07:59.5975611Z Entering 'third_party/FXdiv' 2025-12-04T11:07:59.5995656Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T11:07:59.6016184Z Entering 'third_party/NNPACK' 2025-12-04T11:07:59.6036011Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T11:07:59.6052648Z Entering 'third_party/NVTX' 2025-12-04T11:07:59.6072746Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T11:07:59.6094817Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T11:07:59.6120144Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T11:07:59.6142457Z Entering 'third_party/XNNPACK' 2025-12-04T11:07:59.6164072Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T11:07:59.6196486Z Entering 'third_party/aiter' 2025-12-04T11:07:59.6212993Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T11:07:59.6233141Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T11:07:59.6263868Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T11:07:59.6296295Z Entering 'third_party/benchmark' 2025-12-04T11:07:59.6316894Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T11:07:59.6335099Z Entering 'third_party/composable_kernel' 2025-12-04T11:07:59.6360928Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T11:07:59.6391259Z Entering 'third_party/cpp-httplib' 2025-12-04T11:07:59.6405140Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T11:07:59.6422410Z Entering 'third_party/cpuinfo' 2025-12-04T11:07:59.6450448Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T11:07:59.6464338Z Entering 'third_party/cudnn_frontend' 2025-12-04T11:07:59.6490598Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T11:07:59.6514838Z Entering 'third_party/cutlass' 2025-12-04T11:07:59.6530892Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T11:07:59.6551583Z Entering 'third_party/fbgemm' 2025-12-04T11:07:59.6570770Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T11:07:59.6594875Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T11:07:59.6619775Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T11:07:59.6647625Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T11:07:59.6673084Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T11:07:59.6691748Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T11:07:59.6715811Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T11:07:59.6731647Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T11:07:59.6752675Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T11:07:59.6781560Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T11:07:59.6799482Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T11:07:59.6815365Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T11:07:59.6843237Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T11:07:59.6863071Z Entering 'third_party/fbgemm/external/json' 2025-12-04T11:07:59.6889295Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T11:07:59.6913377Z Entering 'third_party/flash-attention' 2025-12-04T11:07:59.6941320Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T11:07:59.6964790Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T11:07:59.6993024Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T11:07:59.7015525Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T11:07:59.7032225Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T11:07:59.7061965Z Entering 'third_party/flatbuffers' 2025-12-04T11:07:59.7080043Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T11:07:59.7100195Z Entering 'third_party/fmt' 2025-12-04T11:07:59.7114978Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T11:07:59.7135666Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T11:07:59.7161827Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T11:07:59.7176719Z Entering 'third_party/gloo' 2025-12-04T11:07:59.7202560Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T11:07:59.7224949Z Entering 'third_party/googletest' 2025-12-04T11:07:59.7250854Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T11:07:59.7268080Z Entering 'third_party/ideep' 2025-12-04T11:07:59.7283838Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T11:07:59.7305025Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T11:07:59.7332997Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T11:07:59.7361388Z Entering 'third_party/ittapi' 2025-12-04T11:07:59.7384950Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T11:07:59.7404916Z Entering 'third_party/kineto' 2025-12-04T11:07:59.7433945Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T11:07:59.7447489Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T11:07:59.7471417Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T11:07:59.7489242Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T11:07:59.7511753Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T11:07:59.7526126Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T11:07:59.7544234Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T11:07:59.7563771Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T11:07:59.7581871Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T11:07:59.7598160Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T11:07:59.7617101Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T11:07:59.7633940Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T11:07:59.7656153Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T11:07:59.7692367Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T11:07:59.7706159Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T11:07:59.7726258Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T11:07:59.7746613Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T11:07:59.7764604Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T11:07:59.7784370Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T11:07:59.7809429Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T11:07:59.7826530Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T11:07:59.7844330Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T11:07:59.7864369Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T11:07:59.7885142Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T11:07:59.7906374Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T11:07:59.7925529Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T11:07:59.7945707Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T11:07:59.7974349Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T11:07:59.7988231Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T11:07:59.8005237Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T11:07:59.8024160Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T11:07:59.8045148Z Entering 'third_party/kleidiai' 2025-12-04T11:07:59.8063932Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T11:07:59.8081893Z Entering 'third_party/mimalloc' 2025-12-04T11:07:59.8104696Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T11:07:59.8121102Z Entering 'third_party/nlohmann' 2025-12-04T11:07:59.8144030Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T11:07:59.8161713Z Entering 'third_party/onnx' 2025-12-04T11:07:59.8184600Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T11:07:59.8213270Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T11:07:59.8235128Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T11:07:59.8259999Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T11:07:59.8283929Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T11:07:59.8299722Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T11:07:59.8323044Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T11:07:59.8336683Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T11:07:59.8354731Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T11:07:59.8378454Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T11:07:59.8396720Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T11:07:59.8418441Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T11:07:59.8436750Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T11:07:59.8459593Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T11:07:59.8477500Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T11:07:59.8499096Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T11:07:59.8521404Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T11:07:59.8540712Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T11:07:59.8558034Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T11:07:59.8577567Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T11:07:59.8596507Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T11:07:59.8618853Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T11:07:59.8637018Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T11:07:59.8659864Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T11:07:59.8675368Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T11:07:59.8715862Z Entering 'third_party/pocketfft' 2025-12-04T11:07:59.8731667Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T11:07:59.8749779Z Entering 'third_party/protobuf' 2025-12-04T11:07:59.8769211Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T11:07:59.8802244Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T11:07:59.8824050Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T11:07:59.8840417Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T11:07:59.8862731Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T11:07:59.8883404Z Entering 'third_party/psimd' 2025-12-04T11:07:59.8905957Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T11:07:59.8924601Z Entering 'third_party/pthreadpool' 2025-12-04T11:07:59.8945929Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T11:07:59.8963116Z Entering 'third_party/pybind11' 2025-12-04T11:07:59.8988238Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T11:07:59.9001763Z Entering 'third_party/python-peachpy' 2025-12-04T11:07:59.9027793Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T11:07:59.9047894Z Entering 'third_party/sleef' 2025-12-04T11:07:59.9067315Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T11:07:59.9091417Z Entering 'third_party/tensorpipe' 2025-12-04T11:07:59.9115903Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T11:07:59.9136256Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T11:07:59.9167461Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T11:07:59.9187808Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T11:07:59.9215080Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T11:07:59.9240779Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T11:07:59.9265321Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T11:07:59.9279055Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T11:07:59.9297787Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T11:07:59.9319416Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T11:07:59.9337883Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T11:07:59.9387054Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9413819Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9438251Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9462164Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9486467Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9506705Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9531936Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9555637Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9581015Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9605059Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9637212Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9646579Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9675389Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9696800Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9725177Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9746550Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9766711Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9787801Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9816994Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9854173Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9880017Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9905112Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9930913Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9954339Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:07:59.9978401Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0015687Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0043338Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0069490Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0096062Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0125832Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0153977Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0178664Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0206722Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0235045Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0259450Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0287432Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0316365Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0344846Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0370694Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0391448Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0418748Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0448852Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0477123Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0506176Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0538349Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0565684Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0591060Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0616205Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0641766Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0667781Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0697297Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0724683Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0750256Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0776585Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0801432Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0827193Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0852419Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0872606Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0898584Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0922338Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0947381Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.0976647Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1000794Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1023550Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1050335Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1079817Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1106441Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1132450Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1154600Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1179711Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1208860Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1236315Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1260466Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1288402Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1316392Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1341043Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1369719Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1396887Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1421683Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1449813Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1477560Z [command]/usr/bin/git config --file /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T11:08:00.1661967Z Cleaning up orphan processes